Robust URL Classification With Generative Adversarial Networks

Trevisan, Martino; Drago, Idilio

doi:10.1145/3308897.3308959

Classifying URLs is essential for different applications, such as parental control, URL filtering and Ads/tracking protection. Such systems historically identify URLs by means of regular expressions, even if machine learning alternatives have been proposed to overcome the time-consuming maintenance of classification rules. Classical machine learning algorithms, however, require large samples of URLs to train the models, covering the diverse classes of URLs (i.e., a ground truth), which somehow limits the applicability of the approach. We here give a first step towards the use of Generative Adversarial Neural Networks (GANs) to classify URLs. GANs are attractive for this problem for two reasons. First, GANs can produce samples of URLs belonging to specific classes even if exposed to a limited training set, outputting both synthetic traces and a robust discriminator. Second, a GAN can be trained to discriminate a class of URLs without being exposed to all other URLs classes – i.e., GANs are robust even if not exposed to uninteresting URL classes during training. Experiments on real data show that not only the generated synthetic traces are somehow realistic, but also the URL classification is accurate with GANs. © is is held held by by author/owner(s). author/owner(s).

Robust URL Classification With Generative Adversarial Networks / Trevisan, M., Drago, I.. - In: PERFORMANCE EVALUATION REVIEW. - ISSN 0163-5999. - ELETTRONICO. - 46:3(2019), pp. 143-146. [10.1145/3308897.3308959]

Robust URL Classification With Generative Adversarial Networks

Martino Trevisan;Idilio Drago

2019

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno del prodotto
	
				2019
			
	Codice DOI
	
				https://dx.doi.org/10.1145/3308897.3308959
			
	Titolo della Rivista
	
				PERFORMANCE EVALUATION REVIEW
			
	Appare nelle tipologie
	
				1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
url_gan.pdf accesso aperto Descrizione: Pre-Print, pubblica Tipologia: 2. Post-print / Author's Accepted Manuscript Licenza: Pubblico - Tutti i diritti riservati Dimensione 414.98 kB Formato Adobe PDF Visualizza/Apri	414.98 kB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2723875

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

PORTO @ Archivio Istituzionale della Ricerca

Robust URL Classification With Generative Adversarial Networks

Martino Trevisan;Idilio Drago

2019

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Attenzione

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)