At the beginning of 2020, Italy was the country with the highest number of COVID‑19 cases, not only in Europe, but also in the rest of the world, and Lombardy was the most heavily hit region of Italy. The objective of this research is to understand which variables have determined the prevalence of cases in Lombardy and in other highly‑affected European regions. We consider the first and second waves of the COVID‑19 pandemic, using a set of 22 variables related to economy, population, healthcare and education. Regions with a high prevalence of cases are extracted by means of binary classifiers, then the most relevant variables for the classification are determined, and the robustness of the analysis is assessed. Our results show that the most meaningful features to identify high‑prevalence regions include high number of hours spent in work environments, high life expectancy, and low number of people leaving from education and neither employed nor educated or trained.
Social and economic variables explain COVID-19 diffusion in European regions / Cancedda, Christian; Cappellato, Alessio; Maninchedda, Luigi; Meacci, Leonardo; Peracchi, Sofia; Salerni, Claudia; Baralis, Elena; Giobergia, Flavio; Ceri, Stefano. - In: SCIENTIFIC REPORTS. - ISSN 2045-2322. - 14:1(2024). [10.1038/s41598-024-58218-0]
Social and economic variables explain COVID-19 diffusion in European regions
baralis, elena;giobergia, flavio;
2024
Abstract
At the beginning of 2020, Italy was the country with the highest number of COVID‑19 cases, not only in Europe, but also in the rest of the world, and Lombardy was the most heavily hit region of Italy. The objective of this research is to understand which variables have determined the prevalence of cases in Lombardy and in other highly‑affected European regions. We consider the first and second waves of the COVID‑19 pandemic, using a set of 22 variables related to economy, population, healthcare and education. Regions with a high prevalence of cases are extracted by means of binary classifiers, then the most relevant variables for the classification are determined, and the robustness of the analysis is assessed. Our results show that the most meaningful features to identify high‑prevalence regions include high number of hours spent in work environments, high life expectancy, and low number of people leaving from education and neither employed nor educated or trained.File | Dimensione | Formato | |
---|---|---|---|
s41598-024-56267-z.pdf
accesso aperto
Tipologia:
2a Post-print versione editoriale / Version of Record
Licenza:
Creative commons
Dimensione
5.17 MB
Formato
Adobe PDF
|
5.17 MB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/11583/2988246