LogPrécis: Unleashing language models for automated malicious log analysis

Boffa, Matteo; Drago, Idilio; Mellia, Marco; Vassio, Luca; Giordano, Danilo; Valentim, Rodolfo; Houidi, Zied Ben

doi:10.1016/j.cose.2024.103805

Security logs are the key to understanding attacks and diagnosing vulnerabilities. Often coming in the form of text logs, their analysis remains a daunting challenge. Language Models (LMs) have demonstrated unmatched potential in understanding natural and programming languages. The question arises as to whether and how LMs could be also used to automatise the analysis of security logs. We here systematically study how to benefit from the state-of-the-art LM to support the analysis of text-like Unix shell attack logs automatically. For this, we thoroughly designed LogPrécis. LogPrécis receives as input malicious shell sessions. It then automatically identifies and assigns the attacker tactic to each portion of the session, i.e., unveiling the sequence of the attacker's goals. This creates a unique attack fingerprint. We demonstrate LogPrécis capability to support the analysis of two large datasets containing about 400,000 unique Unix shell attacks recorded in a 2-year-long honeypot deployment. LogPrécis reduces the analysis to about 3,000 unique fingerprints. Such abstraction lets us better understand attacks, extract attack prototypes, detect novelties, and track families and mutations. Overall, LogPrécis, released as open source, demonstrates the potential of adopting LMs for security analysis and paves the way for better and more responsive defence against cyberattacks.

LogPrécis: Unleashing language models for automated malicious log analysis / Boffa, Matteo; Drago, Idilio; Mellia, Marco; Vassio, Luca; Giordano, Danilo; Valentim, Rodolfo; Houidi, Zied Ben. - In: COMPUTERS & SECURITY. - ISSN 0167-4048. - ELETTRONICO. - 141:(2024). [10.1016/j.cose.2024.103805]

LogPrécis: Unleashing language models for automated malicious log analysis

Boffa, Matteo;Drago, Idilio;Mellia, Marco;Vassio, Luca;Giordano, Danilo;Valentim, Rodolfo;Houidi, Zied Ben

2024

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno del prodotto
	
				2024
			
	Codice DOI
	
				https://dx.doi.org/10.1016/j.cose.2024.103805
			
	Titolo della Rivista
	
				COMPUTERS & SECURITY
			
	Appare nelle tipologie
	
				1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
1-s2.0-S0167404824001068-main.pdf accesso aperto Tipologia: 2a Post-print versione editoriale / Version of Record Licenza: Creative commons Dimensione 2.69 MB Formato Adobe PDF Visualizza/Apri	2.69 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2987742

PORTO @ Archivio Istituzionale della Ricerca

LogPrécis: Unleashing language models for automated malicious log analysis

Boffa, Matteo;Drago, Idilio;Mellia, Marco;Vassio, Luca;Giordano, Danilo;Valentim, Rodolfo;Houidi, Zied Ben

2024

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)