The human gut microbiome plays a critical role in maintaining host health and homeostasis, and current literature suggests a bidirectional relationship between microbiome ecology and host well-being. DNA metabarcoding has emerged as a powerful tool for investigating microbiome imbalances (i.e., dysbiosis). While the prokaryotic microbiome has been extensively studied, the fungal counterpart - or mycobiome - remains largely unexplored, despite its recognized role from the perinatal stage onward. Here, we present a comprehensive survey based on DNA metabarcoding analysis of approximately 1,500 publicly available ITS1 samples. This survey integrates conventional statistical approaches with Machine Learning (ML) methods coupled with explainable Artificial Intelligence (XAI). ML models successfully predicted host health status with accuracies exceeding 80%, and fungal genera such as Eurotium, Aureobasidium, Candida, and Cutaneotrichosporon emerged as key classification features. This study introduces a cutting-edge multiview analytical framework applied to publicly available mycobiome data, highlighting the potential of fungal community profiling as a non-invasive tool to support health diagnostics.
Machine learning-based assessment of the healthy human gut mycobiota landscape using ITS1 DNA metabarcoding data
Defazio, Giuseppe;De Robertis, Mariangela;Pesole, Graziano
;Fosso, Bruno
2026-01-01
Abstract
The human gut microbiome plays a critical role in maintaining host health and homeostasis, and current literature suggests a bidirectional relationship between microbiome ecology and host well-being. DNA metabarcoding has emerged as a powerful tool for investigating microbiome imbalances (i.e., dysbiosis). While the prokaryotic microbiome has been extensively studied, the fungal counterpart - or mycobiome - remains largely unexplored, despite its recognized role from the perinatal stage onward. Here, we present a comprehensive survey based on DNA metabarcoding analysis of approximately 1,500 publicly available ITS1 samples. This survey integrates conventional statistical approaches with Machine Learning (ML) methods coupled with explainable Artificial Intelligence (XAI). ML models successfully predicted host health status with accuracies exceeding 80%, and fungal genera such as Eurotium, Aureobasidium, Candida, and Cutaneotrichosporon emerged as key classification features. This study introduces a cutting-edge multiview analytical framework applied to publicly available mycobiome data, highlighting the potential of fungal community profiling as a non-invasive tool to support health diagnostics.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


