Unsupervised Detection of Web Trackers

Metwalley, Hassan; Traverso, Stefano; Mellia, Marco

doi:10.1109/GLOCOM.2015.7417499

When browsing, users are consistently tracked by parties whose business builds on the value of collected data. The privacy implications are serious. Consumers and corporates do worry about the information they unknowingly expose to the outside world, and they claim for mechanisms to curb this leakage. Existing countermeasures to web tracking either base on hostname blacklists whose origin is impossible to know and must be continuously updated. This paper presents a novel, unsupervised methodology that leverages application-level traffic logs to automatically detect services running some tracking activity, thus enabling the generation of curated blacklists. The methodology builds on an algorithm that pinpoints pieces of information containing user identifiers exposed in URL queries in HTTP(S) transactions. We validate our algorithm over an artificial dataset obtained by visiting the top 200 most popular websites in the Alexa rank. Results are excellent. Our algorithm identifies 34 new third- party trackers not present in available blacklists. By analyzing the output of our algorithm, some privacy-related interactions emerge. For instance, we observe scenarios clearly hinting to Cookie Matching practice, for which information about users’ activity gets shared across several different third-parties.

Unsupervised Detection of Web Trackers / Metwalley, Hassan; Traverso, Stefano; Mellia, Marco. - ELETTRONICO. - (2015), pp. 1-6. ( IEEE Globecom 2015 San Diego, CA Dicembre 1025) [10.1109/GLOCOM.2015.7417499].

Unsupervised Detection of Web Trackers

METWALLEY, HASSAN;TRAVERSO, STEFANO;MELLIA, Marco

2015

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno del prodotto
	
				2015
			
	Codice ISBN
	
				978-1-4799-5952-5
978-1-4799-5952-5
			
	Appare nelle tipologie
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
TrackingGlobecom2015.pdf accesso aperto Descrizione: Camera ready Tipologia: 2. Post-print / Author's Accepted Manuscript Licenza: Pubblico - Tutti i diritti riservati Dimensione 454.98 kB Formato Adobe PDF Visualizza/Apri	454.98 kB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2641567

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

PORTO @ Archivio Istituzionale della Ricerca

Unsupervised Detection of Web Trackers

METWALLEY, HASSAN;TRAVERSO, STEFANO;MELLIA, Marco

2015

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Attenzione

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)