Expressive generalized itemsets

Baralis, Elena Maria; Cagliero, Luca; Cerquitelli, Tania; D’Elia, V.; Garza, Paolo

doi:10.1016/j.ins.2014.03.056

Generalized itemset mining is a powerful tool to discover multiple-level correlations among the analyzed data. A taxonomy is used to aggregate data items into higher-level concepts and to discover frequent recurrences among data items at different granularity levels. However, since traditional high-level itemsets may also represent the knowledge covered by their lower-level frequent descendant itemsets, the expressiveness of high-level itemsets can be rather limited. To overcome this issue, this article proposes two novel itemset types, called Expressive Generalized Itemset (EGI) and Maximal Expressive Generalized Itemset (Max-EGI), in which the frequency of occurrence of a high-level itemset is evaluated only on the portion of data not yet covered by any of its frequent descendants. Specifically, EGI s represent, at a high level of abstraction, the knowledge associated with sets of infrequent itemsets, while Max-EGIs compactly represent all the infrequent descendants of a generalized itemset. Furthermore, we also propose an algorithm to discover Max-EGIs at the top of the traditionally mined itemsets. Experiments, performed on both real and synthetic datasets, demonstrate the effectiveness, efficiency, and scalability of the proposed approach.

Expressive generalized itemsets / Baralis, ELENA MARIA; Cagliero, Luca; Cerquitelli, Tania; D’Elia, V.; Garza, Paolo. - In: INFORMATION SCIENCES. - ISSN 0020-0255. - 278:(2014), pp. 327-343. [10.1016/j.ins.2014.03.056]

Expressive generalized itemsets

BARALIS, ELENA MARIA;CAGLIERO, LUCA;CERQUITELLI, TANIA;D’Elia V.;GARZA, PAOLO

2014

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno del prodotto
	
				2014
			
	Codice DOI
	
				https://dx.doi.org/10.1016/j.ins.2014.03.056
			
	Titolo della Rivista
	
				INFORMATION SCIENCES
			
	Appare nelle tipologie
	
				1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
2543388_draft.pdf accesso aperto Tipologia: 1. Preprint / submitted version [pre- review] Licenza: Pubblico - Tutti i diritti riservati Dimensione 477.54 kB Formato Adobe PDF Visualizza/Apri	477.54 kB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2543388

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

PORTO @ Archivio Istituzionale della Ricerca

Expressive generalized itemsets

BARALIS, ELENA MARIA;CAGLIERO, LUCA;CERQUITELLI, TANIA;D’Elia V.;GARZA, PAOLO

2014

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Attenzione

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)