i-DarkVec: Incremental Embeddings for Darknet Traffic Analysis

Gioacchini, Luca; Vassio, Luca; Mellia, Marco; Drago, Idilio; Zied Ben Houidi,; Rossi, Dario

doi:10.1145/3595378

Darknets are probes listening to traffic reaching IP addresses that host no services. Traffic reaching a darknet results from the actions of internet scanners, botnets, and possibly misconfigured hosts. Such peculiar nature of the darknet traffic makes darknets a valuable instrument to discover malicious online activities, e.g., identifying coordinated actions performed by bots or scanners. However, the massive amount of packets and sources that darknets observe makes it hard to extract meaningful insights, calling for scalable tools to automatically identify and group sources that share similar behaviour. We here present i-DarkVec, a methodology to learn meaningful representations of Darknet traffic. i-DarkVec leverages Natural Language Processing techniques (e.g., Word2Vec) to capture the co-occurrence patterns that emerge when scanners or bots launch coordinated actions. As in NLP problems, the embeddings learned with i-DarkVec enable several new machine learning tasks on the darknet traffic, such as identifying clusters of senders engaged in similar activities. We extensively test i-DarkVec and explore its design space in a case study using real darknets. We show that with a proper definition of services, the learned embeddings can be used to (i) solve the classification problem to associate unknown sources’ IP addresses to the correct classes of coordinated actors and (ii) automatically identify clusters of previously unknown sources performing similar attacks and scans, easing the security analyst’s job. i-DarkVec leverages a novel incremental embedding learning approach that is scalable and robust to traffic changes, making it applicable to dynamic and large-scale scenarios.

i-DarkVec: Incremental Embeddings for Darknet Traffic Analysis / Gioacchini, Luca; Vassio, Luca; Mellia, Marco; Drago, Idilio; Ben Houidi, Zied; Rossi, Dario. - In: ACM TRANSACTIONS ON INTERNET TECHNOLOGY. - ISSN 1533-5399. - ELETTRONICO. - 23:3(2023), pp. 1-28. [10.1145/3595378]

i-DarkVec: Incremental Embeddings for Darknet Traffic Analysis

Luca Gioacchini;Luca Vassio;Marco Mellia;Idilio Drago;Zied Ben Houidi;Dario Rossi

2023

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno del prodotto
	
				2023
			
	Codice DOI
	
				https://dx.doi.org/10.1145/3595378
			
	Titolo della Rivista
	
				ACM TRANSACTIONS ON INTERNET TECHNOLOGY
			
	Appare nelle tipologie
	
				1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
idarkvec_toit.pdf accesso aperto Descrizione: i-DarkVec: Incremental Embeddings for Darknet Traffic Analysis Tipologia: 2a Post-print versione editoriale / Version of Record Licenza: PUBBLICO - Tutti i diritti riservati Dimensione 6.39 MB Formato Adobe PDF Visualizza/Apri	6.39 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2981177

PORTO @ Archivio Istituzionale della Ricerca

i-DarkVec: Incremental Embeddings for Darknet Traffic Analysis

Luca Gioacchini;Luca Vassio;Marco Mellia;Idilio Drago;Zied Ben Houidi;Dario Rossi

2023

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)