This paper describes the participation of a team from the University of Bari in the Decluttering Challenge organized in the scope of the DocGen2 workshop. We propose a supervised approach relying on a minimal set of non-textual features (length, overlapping between the comment text and the source code, code block type, tags, comment type) and classical textual features (bag-of-words). Our system ranked 2nd in the documentation decluttering task.

Leveraging Textual and Non-Textual Features for Documentation Decluttering

Basile P.;Novielli N.
2020-01-01

Abstract

This paper describes the participation of a team from the University of Bari in the Decluttering Challenge organized in the scope of the DocGen2 workshop. We propose a supervised approach relying on a minimal set of non-textual features (length, overlapping between the comment text and the source code, code block type, tags, comment type) and classical textual features (bag-of-words). Our system ranked 2nd in the documentation decluttering task.
2020
978-1-7281-5619-4
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11586/348734
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact