SparseHash: Embedding Jaccard coefficient between supports of signals

Valsesia, Diego; Fosson, Sophie; Ravazzi, Chiara; Bianchi, Tiziano; Magli, Enrico

doi:10.1109/ICMEW.2016.7574678

Embeddings provide compact representations of signals to be used to perform inference in a wide variety of tasks. Random projections have been extensively used to preserve Euclidean distances or inner products of high dimensional signals into low dimensional representations. Different techniques based on hashing have been used in the past to embed set similarity metrics such as the Jaccard coefficient. In this paper we show that a class of random projections based on sparse matrices can be used to preserve the Jaccard coefficient between the supports of sparse signals. Our proposed construction can be therefore used in a variety of tasks in machine learning and multimedia signal processing where the overlap between signal supports is a relevant similarity metric. We also present an application in retrieval of similar text documents where SparseHash improves over MinHash.

SparseHash: Embedding Jaccard coefficient between supports of signals / Valsesia, Diego; Fosson, Sophie; Ravazzi, Chiara; Bianchi, Tiziano; Magli, Enrico. - ELETTRONICO. - (2016), pp. 1-6. (Intervento presentato al convegno 2016 IEEE International Conference on Multimedia & Expo - Workshop on Sparsity and Compressive Sensing in Multimedia (MM-SPARSE) tenutosi a Seattle , USA nel 11-15 July 2016) [10.1109/ICMEW.2016.7574678].

SparseHash: Embedding Jaccard coefficient between supports of signals

VALSESIA, DIEGO;FOSSON, SOPHIE;RAVAZZI, CHIARA;BIANCHI, TIZIANO;MAGLI, ENRICO

2016

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno del prodotto
	
				2016
			
	Codice ISBN
	
				978-1-5090-1552-8
978-1-5090-1552-8
			
	Appare nelle tipologie
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
valsesia_ICMEW16_OA.pdf accesso aperto Descrizione: versione autore Tipologia: 2. Post-print / Author's Accepted Manuscript Licenza: Pubblico - Tutti i diritti riservati Dimensione 266.91 kB Formato Adobe PDF Visualizza/Apri	266.91 kB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2651377

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

PORTO @ Archivio Istituzionale della Ricerca

SparseHash: Embedding Jaccard coefficient between supports of signals

VALSESIA, DIEGO;FOSSON, SOPHIE;RAVAZZI, CHIARA;BIANCHI, TIZIANO;MAGLI, ENRICO

2016

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Attenzione

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)