Data scarcity remains a major obstacle to the application of deep learning techniques in medical imaging, particularly for rare neurodegenerative diseases. This study investigates the use of denoising diffusion probabilistic models (DDPMs) to generate synthetic 3D T1-weighted brain MRI images in this context. Addressing the dual challenges of limited training data and structural fidelity, we propose a generative pipeline trained on a multicenter dataset of healthy subjects. The model suggests the potential to produce anatomically coherent synthetic scans with realistic variability. Quantitative evaluation based on Maximum Mean Discrepancy confirms the similarity between real and generated data distributions, while visual assessments highlight the preservation of global and local brain structures. Despite limitations in high-frequency detail reconstruction, the results suggest that DDPMs hold promise as a tool for augmenting neuroimaging datasets and supporting downstream tasks such as classification and segmentation. This work lays the foundation for future research aimed at improving resolution and adapting generative models to the specific challenges of rare disease imaging.
Diffusion Models for Neuroimaging Data Augmentation: Assessing Realism and Clinical Relevance
Mallardi, Giulio
Conceptualization
;Calefato, Fabio;Lanubile, Filippo;Logroscino, Giancarlo;Tafuri, Benedetta
2025-01-01
Abstract
Data scarcity remains a major obstacle to the application of deep learning techniques in medical imaging, particularly for rare neurodegenerative diseases. This study investigates the use of denoising diffusion probabilistic models (DDPMs) to generate synthetic 3D T1-weighted brain MRI images in this context. Addressing the dual challenges of limited training data and structural fidelity, we propose a generative pipeline trained on a multicenter dataset of healthy subjects. The model suggests the potential to produce anatomically coherent synthetic scans with realistic variability. Quantitative evaluation based on Maximum Mean Discrepancy confirms the similarity between real and generated data distributions, while visual assessments highlight the preservation of global and local brain structures. Despite limitations in high-frequency detail reconstruction, the results suggest that DDPMs hold promise as a tool for augmenting neuroimaging datasets and supporting downstream tasks such as classification and segmentation. This work lays the foundation for future research aimed at improving resolution and adapting generative models to the specific challenges of rare disease imaging.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


