This paper presents a novel Topological Machine Learning (TML) framework aimed at improving the classification of lung lesions in CT scans. The approach integrates topological data analysis with machine learning, leveraging persistent homology to derive a set of robust topological descriptors–including functional, vector-based, and image-based features. These descriptors represent the intrinsic shape and structure of lung lesions at multiple scales and, once properly converted into numerical feature vectors, are suitable for use in various classification algorithms. The framework is evaluated on the publicly available IQ-OTH/NCCD dataset, showing high classification accuracy and consistent performance across lesion types. These results demonstrate the effectiveness of TML –and topology more broadly–in extracting meaningful patterns from complex medical imaging data while maintaining interpretability and data efficiency. The proposed methodology offers a promising alternative to conventional radiomics or deep learning methods, especially in scenarios where model transparency, limited training data, and generalization are critical for clinical decision-making and diagnostics.

Uncovering Lung Lesion Patterns in Computed Tomography Scans Through Topological Machine Learning

Serena Grazia De Benedictis
;
Nicoletta Del Buono
2026-01-01

Abstract

This paper presents a novel Topological Machine Learning (TML) framework aimed at improving the classification of lung lesions in CT scans. The approach integrates topological data analysis with machine learning, leveraging persistent homology to derive a set of robust topological descriptors–including functional, vector-based, and image-based features. These descriptors represent the intrinsic shape and structure of lung lesions at multiple scales and, once properly converted into numerical feature vectors, are suitable for use in various classification algorithms. The framework is evaluated on the publicly available IQ-OTH/NCCD dataset, showing high classification accuracy and consistent performance across lesion types. These results demonstrate the effectiveness of TML –and topology more broadly–in extracting meaningful patterns from complex medical imaging data while maintaining interpretability and data efficiency. The proposed methodology offers a promising alternative to conventional radiomics or deep learning methods, especially in scenarios where model transparency, limited training data, and generalization are critical for clinical decision-making and diagnostics.
2026
978-3-032-11381-8
978-3-032-11380-1
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11586/564080
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact