At the beginning of 2020, Italy was the country with the highest number of COVID‑19 cases, not only in Europe, but also in the rest of the world, and Lombardy was the most heavily hit region of Italy. The objective of this research is to understand which variables have determined the prevalence of cases in Lombardy and in other highly‑affected European regions. We consider the first and second waves of the COVID‑19 pandemic, using a set of 22 variables related to economy, population, healthcare and education. Regions with a high prevalence of cases are extracted by means of binary classifiers, then the most relevant variables for the classification are determined, and the robustness of the analysis is assessed. Our results show that the most meaningful features to identify high‑prevalence regions include high number of hours spent in work environments, high life expectancy, and low number of people leaving from education and neither employed nor educated or trained.

Social and economic variables explain COVID-19 diffusion in European regions / Cancedda, Christian; Cappellato, Alessio; Maninchedda, Luigi; Meacci, Leonardo; Peracchi, Sofia; Salerni, Claudia; Baralis, Elena; Giobergia, Flavio; Ceri, Stefano. - In: SCIENTIFIC REPORTS. - ISSN 2045-2322. - 14:1(2024). [10.1038/s41598-024-58218-0]

Social and economic variables explain COVID-19 diffusion in European regions

baralis, elena;giobergia, flavio;
2024

Abstract

At the beginning of 2020, Italy was the country with the highest number of COVID‑19 cases, not only in Europe, but also in the rest of the world, and Lombardy was the most heavily hit region of Italy. The objective of this research is to understand which variables have determined the prevalence of cases in Lombardy and in other highly‑affected European regions. We consider the first and second waves of the COVID‑19 pandemic, using a set of 22 variables related to economy, population, healthcare and education. Regions with a high prevalence of cases are extracted by means of binary classifiers, then the most relevant variables for the classification are determined, and the robustness of the analysis is assessed. Our results show that the most meaningful features to identify high‑prevalence regions include high number of hours spent in work environments, high life expectancy, and low number of people leaving from education and neither employed nor educated or trained.
File in questo prodotto:
File Dimensione Formato  
s41598-024-56267-z.pdf

accesso aperto

Tipologia: 2a Post-print versione editoriale / Version of Record
Licenza: Creative commons
Dimensione 5.17 MB
Formato Adobe PDF
5.17 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2988246