This paper describes the participation of a team from the University of Bari in the Decluttering Challenge organized in the scope of the DocGen2 workshop. We propose a supervised approach relying on a minimal set of non-textual features (length, overlapping between the comment text and the source code, code block type, tags, comment type) and classical textual features (bag-of-words). Our system ranked 2nd in the documentation decluttering task.
Leveraging Textual and Non-Textual Features for Documentation Decluttering
Basile P.;Novielli N.
2020-01-01
Abstract
This paper describes the participation of a team from the University of Bari in the Decluttering Challenge organized in the scope of the DocGen2 workshop. We propose a supervised approach relying on a minimal set of non-textual features (length, overlapping between the comment text and the source code, code block type, tags, comment type) and classical textual features (bag-of-words). Our system ranked 2nd in the documentation decluttering task.File in questo prodotto:
Non ci sono file associati a questo prodotto.
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.