Several cities have built on-the-ground air quality monitoring stations to measure daily concentration of air pollutants, like PM10 and NO2. The identification of the causalities for air pollution will help governments' decision-making on mitigating air pollution and on prioritizing recommendations. This paper presents a two-level methodology based on unsupervised analytics methods, named PANDA, to discover interesting insights from air quality-related data. First, PANDA discovers groups of pollutants that have occurred with similar concentrations. Then, each cluster is locally characterized through three forms of human-readable knowledge to provide interesting correlations between air pollution and meteorological conditions at different abstraction level. As a case study, PANDA has been validated on real pollutant measurements collected in a major Italian city. Preliminary experimental results show that PANDA is effective in discovering cohesive and well-separated groups of similar concentrations of pollutants along with different forms of interpretable correlations among air pollution and weather data.

Characterizing Air-Quality Data Through Unsupervised Analytics Methods / Daraio, Elena; DI CORSO, Evelina; Cerquitelli, Tania; Chiusano, SILVIA ANNA. - ELETTRONICO. - (2018), pp. 205-217. (Intervento presentato al convegno New Trends in Databases and Information Systems tenutosi a Budapest, Hungary nel September, 2-5, 2018) [10.1007/978-3-030-00063-9_20].

Characterizing Air-Quality Data Through Unsupervised Analytics Methods

DARAIO, ELENA;Evelina Di Corso;Tania Cerquitelli;Silvia Chiusano
2018

Abstract

Several cities have built on-the-ground air quality monitoring stations to measure daily concentration of air pollutants, like PM10 and NO2. The identification of the causalities for air pollution will help governments' decision-making on mitigating air pollution and on prioritizing recommendations. This paper presents a two-level methodology based on unsupervised analytics methods, named PANDA, to discover interesting insights from air quality-related data. First, PANDA discovers groups of pollutants that have occurred with similar concentrations. Then, each cluster is locally characterized through three forms of human-readable knowledge to provide interesting correlations between air pollution and meteorological conditions at different abstraction level. As a case study, PANDA has been validated on real pollutant measurements collected in a major Italian city. Preliminary experimental results show that PANDA is effective in discovering cohesive and well-separated groups of similar concentrations of pollutants along with different forms of interpretable correlations among air pollution and weather data.
2018
978-3-030-00062-2
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2713147
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo