Several cities have built on-the-ground air quality monitoring stations to measure daily concentration of air pollutants, like PM10 and NO2. The identification of the causalities for air pollution will help governments' decision-making on mitigating air pollution and on prioritizing recommendations. This paper presents a two-level methodology based on unsupervised analytics methods, named PANDA, to discover interesting insights from air quality-related data. First, PANDA discovers groups of pollutants that have occurred with similar concentrations. Then, each cluster is locally characterized through three forms of human-readable knowledge to provide interesting correlations between air pollution and meteorological conditions at different abstraction level. As a case study, PANDA has been validated on real pollutant measurements collected in a major Italian city. Preliminary experimental results show that PANDA is effective in discovering cohesive and well-separated groups of similar concentrations of pollutants along with different forms of interpretable correlations among air pollution and weather data.
Characterizing Air-Quality Data Through Unsupervised Analytics Methods / Daraio, Elena; DI CORSO, Evelina; Cerquitelli, Tania; Chiusano, SILVIA ANNA. - ELETTRONICO. - (2018), pp. 205-217. (Intervento presentato al convegno New Trends in Databases and Information Systems tenutosi a Budapest, Hungary nel September, 2-5, 2018) [10.1007/978-3-030-00063-9_20].
Characterizing Air-Quality Data Through Unsupervised Analytics Methods
DARAIO, ELENA;Evelina Di Corso;Tania Cerquitelli;Silvia Chiusano
2018
Abstract
Several cities have built on-the-ground air quality monitoring stations to measure daily concentration of air pollutants, like PM10 and NO2. The identification of the causalities for air pollution will help governments' decision-making on mitigating air pollution and on prioritizing recommendations. This paper presents a two-level methodology based on unsupervised analytics methods, named PANDA, to discover interesting insights from air quality-related data. First, PANDA discovers groups of pollutants that have occurred with similar concentrations. Then, each cluster is locally characterized through three forms of human-readable knowledge to provide interesting correlations between air pollution and meteorological conditions at different abstraction level. As a case study, PANDA has been validated on real pollutant measurements collected in a major Italian city. Preliminary experimental results show that PANDA is effective in discovering cohesive and well-separated groups of similar concentrations of pollutants along with different forms of interpretable correlations among air pollution and weather data.Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/11583/2713147
Attenzione
Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo