The awareness that socioeconomic factors play a significant role in the potential onset of cancer is increasingly widespread; however, a clear understanding of the most influential factors is still lacking. In this study, we explore the relationship between cancer incidence, as recorded by the International Agency for Research on Cancer, and the environmental and socioeconomic well-being of countries, measured by the Sustainable Development Goals (SDGs) indicators. To identify relevant predictors of cancer incidence, we construct a weighted complex network where nodes represent SDG indicators, and links correspond to statistically significant correlations between them. We implement community detection to identify a subset of indicators that incorporates the non-redundant dataset's information, and use the selected features for a machine learning prediction of cancer incidence rates. Furthermore, we highlight the most influential SDG indicators by means of an eXplainable Artificial Intelligence analysis. We find that not only health-related indicators play a key role in explaining cancer incidence, but also factors related to agriculture, resource availability, and water cleanliness. These findings provide insights into the complex interplay between socioeconomic, environmental, and health factors. This study aims to expand knowledge on non-intuitive associations related to cancer onset and may contribute to the development of effective public prevention policies.

Exploring the relationship between cancer incidence and the sustainable development goals through complex networks and machine learning

Andrea Lo Sasso;Loredana Bellantuono
;
2025-01-01

Abstract

The awareness that socioeconomic factors play a significant role in the potential onset of cancer is increasingly widespread; however, a clear understanding of the most influential factors is still lacking. In this study, we explore the relationship between cancer incidence, as recorded by the International Agency for Research on Cancer, and the environmental and socioeconomic well-being of countries, measured by the Sustainable Development Goals (SDGs) indicators. To identify relevant predictors of cancer incidence, we construct a weighted complex network where nodes represent SDG indicators, and links correspond to statistically significant correlations between them. We implement community detection to identify a subset of indicators that incorporates the non-redundant dataset's information, and use the selected features for a machine learning prediction of cancer incidence rates. Furthermore, we highlight the most influential SDG indicators by means of an eXplainable Artificial Intelligence analysis. We find that not only health-related indicators play a key role in explaining cancer incidence, but also factors related to agriculture, resource availability, and water cleanliness. These findings provide insights into the complex interplay between socioeconomic, environmental, and health factors. This study aims to expand knowledge on non-intuitive associations related to cancer onset and may contribute to the development of effective public prevention policies.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11586/556280
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? 0
social impact