The human gut microbiome plays a critical role in maintaining host health and homeostasis, and current literature suggests a bidirectional relationship between microbiome ecology and host well-being. DNA metabarcoding has emerged as a powerful tool for investigating microbiome imbalances (i.e., dysbiosis). While the prokaryotic microbiome has been extensively studied, the fungal counterpart - or mycobiome - remains largely unexplored, despite its recognized role from the perinatal stage onward. Here, we present a comprehensive survey based on DNA metabarcoding analysis of approximately 1,500 publicly available ITS1 samples. This survey integrates conventional statistical approaches with Machine Learning (ML) methods coupled with explainable Artificial Intelligence (XAI). ML models successfully predicted host health status with accuracies exceeding 80%, and fungal genera such as Eurotium, Aureobasidium, Candida, and Cutaneotrichosporon emerged as key classification features. This study introduces a cutting-edge multiview analytical framework applied to publicly available mycobiome data, highlighting the potential of fungal community profiling as a non-invasive tool to support health diagnostics.

Machine learning-based assessment of the healthy human gut mycobiota landscape using ITS1 DNA metabarcoding data

Defazio, Giuseppe;De Robertis, Mariangela;Pesole, Graziano
;
Fosso, Bruno
2026-01-01

Abstract

The human gut microbiome plays a critical role in maintaining host health and homeostasis, and current literature suggests a bidirectional relationship between microbiome ecology and host well-being. DNA metabarcoding has emerged as a powerful tool for investigating microbiome imbalances (i.e., dysbiosis). While the prokaryotic microbiome has been extensively studied, the fungal counterpart - or mycobiome - remains largely unexplored, despite its recognized role from the perinatal stage onward. Here, we present a comprehensive survey based on DNA metabarcoding analysis of approximately 1,500 publicly available ITS1 samples. This survey integrates conventional statistical approaches with Machine Learning (ML) methods coupled with explainable Artificial Intelligence (XAI). ML models successfully predicted host health status with accuracies exceeding 80%, and fungal genera such as Eurotium, Aureobasidium, Candida, and Cutaneotrichosporon emerged as key classification features. This study introduces a cutting-edge multiview analytical framework applied to publicly available mycobiome data, highlighting the potential of fungal community profiling as a non-invasive tool to support health diagnostics.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11586/585001
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? 1
  • Scopus ND
  • ???jsp.display-item.citation.isi??? 0
social impact