Large volumes of data are being produced by modern applications at an ever increasing rate. The automatic analysis of such huge data volume is a challenging task since a large amount of interesting knowledge can be extracted. Association rule mining allows discovering interesting and hidden correlations among data. Since this mining process is characterized by computationally intensive tasks, efficient distributed approaches are needed to increase its scalability. This paper proposes a novel cloud-based service, named SeARuM, to efficiently mine association rules on a distributed computing model. SeARuM consists of a series of distributed MapReduce jobs run in the cloud. Each job performs a different step in the association rule mining process. As a case study, the proposed approach has been applied to the network data scenario. The experimental validation, performed on two real network datasets, shows the effectiveness and the efficiency of SeARuM in mining association rules on a distributed computing model.
Association Rule Mining as a Cloud-Based Service / Apiletti, D.; Baralis, ELENA MARIA; Cerquitelli, Tania; Chiusano, SILVIA ANNA; Grimaudo, Luigi. - STAMPA. - (2013), pp. 461-468. (Intervento presentato al convegno 21st Italian Symposium on Advanced Database Systems tenutosi a Roccella Jonica, Reggio Calabria, Italy nel June 30th – July 04th, 2013).
Association Rule Mining as a Cloud-Based Service
Apiletti D.;BARALIS, ELENA MARIA;CERQUITELLI, TANIA;CHIUSANO, SILVIA ANNA;GRIMAUDO, LUIGI
2013
Abstract
Large volumes of data are being produced by modern applications at an ever increasing rate. The automatic analysis of such huge data volume is a challenging task since a large amount of interesting knowledge can be extracted. Association rule mining allows discovering interesting and hidden correlations among data. Since this mining process is characterized by computationally intensive tasks, efficient distributed approaches are needed to increase its scalability. This paper proposes a novel cloud-based service, named SeARuM, to efficiently mine association rules on a distributed computing model. SeARuM consists of a series of distributed MapReduce jobs run in the cloud. Each job performs a different step in the association rule mining process. As a case study, the proposed approach has been applied to the network data scenario. The experimental validation, performed on two real network datasets, shows the effectiveness and the efficiency of SeARuM in mining association rules on a distributed computing model.Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/11583/2518921
Attenzione
Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo