We present the SchoolDataITR library, which provides an overview on the current status of the Italian educational system by gathering relevant open data on school infrastructure through web scraping and harmonises them into an organic database. In addition to infrastructural information, the software retrieves the results of the Invalsi census survey, which is typically considered a thorough indicator of education quality nationwide. The package is composed of four main groups of functions. The first group retrieves the inputs from the source web pages; the second one is employed for basic data editing; the third one aggregates the data at a given territorial level, either municipalities (LAU) or provinces (NUTS-3); lastly, mapping functions are included to render the final datasets through static or interactive maps. We show the potential application of the software by providing a practical example that highlights the importance of spatial statistics to model data about the educational system at the territorial level. Indeed, territorial disparities can be found across several dimensions of both infrastructure endowment and education quality, representing a significant challenge to territorial sustainability.

A comprehensive analysis of the Italian school system using harmonised open data via the SchoolDataIT R package

Cefalo L.
Writing – Original Draft Preparation
;
2025-01-01

Abstract

We present the SchoolDataITR library, which provides an overview on the current status of the Italian educational system by gathering relevant open data on school infrastructure through web scraping and harmonises them into an organic database. In addition to infrastructural information, the software retrieves the results of the Invalsi census survey, which is typically considered a thorough indicator of education quality nationwide. The package is composed of four main groups of functions. The first group retrieves the inputs from the source web pages; the second one is employed for basic data editing; the third one aggregates the data at a given territorial level, either municipalities (LAU) or provinces (NUTS-3); lastly, mapping functions are included to render the final datasets through static or interactive maps. We show the potential application of the software by providing a practical example that highlights the importance of spatial statistics to model data about the educational system at the territorial level. Indeed, territorial disparities can be found across several dimensions of both infrastructure endowment and education quality, representing a significant challenge to territorial sustainability.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11586/550581
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact