PaWI: Parallel Weighted Itemset Mining by means of MapReduce

Baralis, Elena Maria; Cagliero, Luca; Garza, Paolo; Grimaudo, Luigi

doi:10.1109/BigDataCongress.2015.14

Frequent itemset mining is an exploratory data mining technique that has fruitfully been exploited to extract recurrent co-occurrences between data items. Since in many application contexts items are enriched with weights denoting their relative importance in the analyzed data, pushing item weights into the itemset mining process, i.e., mining weighted itemsets rather than traditional itemsets, is an appealing research direction. Although many efficient in-memory weighted itemset mining algorithms are available in literature, there is a lack of parallel and distributed solutions which are able to scale towards Big Weighted Data. This paper presents a scalable frequent weighted itemset mining algorithm based on the MapReduce paradigm. To demonstrate its actionability and scalability, the proposed algorithm was tested on a real Big dataset collecting approximately 34 millions of reviews of Amazon items. Weights indicate the ratings given by users to the purchased items. The mined itemsets represent combinations of items that were frequently bought together with an overall rating above average.

PaWI: Parallel Weighted Itemset Mining by means of MapReduce / Baralis, E.M., Cagliero, L., Garza, P., Grimaudo, L.. - STAMPA. - Proceedings of the 2015 IEEE International Congress on Big Data:(2015), pp. 25-32. (2015 IEEE International Congress on Big Data New York (USA) 26-30 giugno 2015) [10.1109/BigDataCongress.2015.14].

PaWI: Parallel Weighted Itemset Mining by means of MapReduce

BARALIS, ELENA MARIA;CAGLIERO, LUCA;GARZA, PAOLO;GRIMAUDO, LUIGI

2015

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno del prodotto
	
				2015
			
	Codice ISBN
	
				978-1-4673-7278-7
			
	Appare nelle tipologie
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
PaWI.pdf accesso aperto Descrizione: Draft Tipologia: 2. Post-print / Author's Accepted Manuscript Licenza: Pubblico - Tutti i diritti riservati Dimensione 111.71 kB Formato Adobe PDF Visualizza/Apri	111.71 kB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2639847

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

PORTO @ Archivio Istituzionale della Ricerca

PaWI: Parallel Weighted Itemset Mining by means of MapReduce

BARALIS, ELENA MARIA;CAGLIERO, LUCA;GARZA, PAOLO;GRIMAUDO, LUIGI

2015

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Attenzione

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)