CrimeNet: Neural Structured Learning using Vision Transformer for violence detection

Rendón-Segador, Fernando J.; Álvarez-García, Juan A.; Salazar González, Jose L.; Tommasi, Tatiana

doi:10.1016/j.neunet.2023.01.048

The state of the art in violence detection in videos has improved in recent years thanks to deep learning models, but it is still below 90% of average precision in the most complex datasets, which may pose a problem of frequent false alarms in video surveillance environments and may cause security guards to disable the artificial intelligence system. In this study, we propose a new neural network based on Vision Transformer (ViT) and Neural Structured Learning (NSL) with adversarial training. This network, called CrimeNet, outperforms previous works by a large margin and reduces practically to zero the false positives. Our tests on the four most challenging violence-related datasets (binary and multi-class) show the effectiveness of CrimeNet, improving the state of the art from 9.4 to 22.17 percentage points in ROC AUC depending on the dataset. In addition, we present a generalisation study on our model by training and testing it on different datasets. The obtained results show that CrimeNet improves over competing methods with a gain of between 12.39 and 25.22 percentage points, showing remarkable robustness.

CrimeNet: Neural Structured Learning using Vision Transformer for violence detection / Rendón-Segador, Fernando J.; Álvarez-García, Juan A.; Salazar González, Jose L.; Tommasi, Tatiana. - In: NEURAL NETWORKS. - ISSN 0893-6080. - ELETTRONICO. - 161:(2023), pp. 318-329. [10.1016/j.neunet.2023.01.048]

CrimeNet: Neural Structured Learning using Vision Transformer for violence detection

Fernando J. Rendón-Segador;Juan A. Álvarez-García;Jose L. Salazar González;Tatiana Tommasi

2023

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno del prodotto
	
				2023
			
	Codice DOI
	
				https://dx.doi.org/10.1016/j.neunet.2023.01.048
			
	Titolo della Rivista
	
				NEURAL NETWORKS
			
	Appare nelle tipologie
	
				1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
1-s2.0-S0893608023000606-main.pdf accesso aperto Tipologia: 2a Post-print versione editoriale / Version of Record Licenza: Creative commons Dimensione 1.96 MB Formato Adobe PDF Visualizza/Apri	1.96 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2975827

PORTO @ Archivio Istituzionale della Ricerca

CrimeNet: Neural Structured Learning using Vision Transformer for violence detection

Fernando J. Rendón-Segador;Juan A. Álvarez-García;Jose L. Salazar González;Tatiana Tommasi

2023

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)