The clinical adenoma–carcinoma progression represents a well-established framework for understanding colorectal cancer (CRC) development, although the molecular mechanisms underlying this transition remain only partially understood. Increasing evidence suggests the gut microbiome (GM) as a key modulator of colorectal carcinogenesis, positioning microbial profiling as a promising avenue for noninvasive risk stratification and early detection. In this study, Machine Learning (ML) classifiers integrated with eXplainable Artificial Intelligence (XAI) techniques were employed to identify microbiome-derived biomarkers predictive of CRC and adenomatous lesions. The models were trained on 16S rRNA sequencing data from 453 patients and evaluated through cross-validation, achieving AU-ROC and AU-PRC scores of 0.71 and 0.67, respectively. External validation on an independent Italian cohort ((Formula presented.)) yielded AU-ROC and AU-PRC scores of 0.70 and 0.89, respectively. XAI-based interpretation revealed consistent microbial signatures across datasets. In detail, taxa belonging to the Fusobacterium and Peptostreptococcus genera were associated with increased CRC risk, whereas the Eubacterium eligens group was identified as a robust negative predictor. Beyond classification, patient-level explanations enabled by XAI facilitated the identification of adenoma subgroups exhibiting microbiome profiles converging toward those of CRC, suggesting the presence of transitional microbial states. Moreover, SHAP-based interaction networks uncovered microbial hubs and inter-species dependencies characterizing high-risk configurations, providing insights into the ecological dynamics of colorectal tumorigenesis. These findings demonstrate the added XAI value in elucidating microbiome interactions, enhancing model interpretability, and supporting biologically informed hypotheses. This integrative, explainable framework highlights the potential of AI-driven microbiome analysis in precision oncology and advances the development of interpretable, noninvasive tools for CRC risk prediction and management.

Personalized colorectal cancer risk assessment through explainable AI and Gut microbiome profiling

Novielli, Pierfrancesco;Romano, Donato;Magarelli, Michele;Diacono, Domenico;Di Bitonto, Pierpaolo;Bellotti, Roberto;Tangaro, Sabina
2025-01-01

Abstract

The clinical adenoma–carcinoma progression represents a well-established framework for understanding colorectal cancer (CRC) development, although the molecular mechanisms underlying this transition remain only partially understood. Increasing evidence suggests the gut microbiome (GM) as a key modulator of colorectal carcinogenesis, positioning microbial profiling as a promising avenue for noninvasive risk stratification and early detection. In this study, Machine Learning (ML) classifiers integrated with eXplainable Artificial Intelligence (XAI) techniques were employed to identify microbiome-derived biomarkers predictive of CRC and adenomatous lesions. The models were trained on 16S rRNA sequencing data from 453 patients and evaluated through cross-validation, achieving AU-ROC and AU-PRC scores of 0.71 and 0.67, respectively. External validation on an independent Italian cohort ((Formula presented.)) yielded AU-ROC and AU-PRC scores of 0.70 and 0.89, respectively. XAI-based interpretation revealed consistent microbial signatures across datasets. In detail, taxa belonging to the Fusobacterium and Peptostreptococcus genera were associated with increased CRC risk, whereas the Eubacterium eligens group was identified as a robust negative predictor. Beyond classification, patient-level explanations enabled by XAI facilitated the identification of adenoma subgroups exhibiting microbiome profiles converging toward those of CRC, suggesting the presence of transitional microbial states. Moreover, SHAP-based interaction networks uncovered microbial hubs and inter-species dependencies characterizing high-risk configurations, providing insights into the ecological dynamics of colorectal tumorigenesis. These findings demonstrate the added XAI value in elucidating microbiome interactions, enhancing model interpretability, and supporting biologically informed hypotheses. This integrative, explainable framework highlights the potential of AI-driven microbiome analysis in precision oncology and advances the development of interpretable, noninvasive tools for CRC risk prediction and management.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11586/552183
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 1
social impact