A Multiply-And-Max/Min Neuron Paradigm for Aggressively Prunable Deep Neural Networks

Prono, Luciano; Bich, Philippe; Boretti, Chiara; Mangia, Mauro; Pareschi, Fabio; Rovatti, Riccardo; Setti, Gianluca

doi:10.1109/tnnls.2025.3527644

The growing interest in the Internet of Things (IoT) and mobile artificial intelligence applications is pushing the investigation on deep neural networks (DNNs) that can operate at the edge using low-resources/energy devices. To obtain such a goal, several pruning techniques have been proposed in the literature. They aim to reduce the number of interconnections - and consequently the size, and the corresponding computing and storage requirements - of DNNs that traditionally rely on classic multiply-and-accumulate (MAC) neurons. In this work, we propose a novel neuron structure based on a multiply-and-max/min (MAM) map-reduce paradigm, and we show that by exploiting this new paradigm it is possible to build naturally and aggressively prunable DNN layers, with a negligible loss in performance. This novel structure allows a greater interconnection sparsity when compared to classic MAC-based DNN layers. Moreover, most of the already existing state-of-the-art pruning techniques can be used with MAM layers with little to no changes. To test the pruning performance of MAM, we employ different models - AlexNet, VGG-16 and the more recent ViT-B/16 - and different computer vision datasets - CIFAR-10, CIFAR-100, and ImageNet-1K. Multiple pruning approaches are applied, ranging from single-shot methods to training-dependent and iterative techniques. As a notable example, we test MAM on the ViT-B/16 model fine-tuned on the ImageNet-1K task and apply one-shot gradient-based pruning. We remove interconnections until the model experiences a 6% decrease in accuracy. While the selected MAC-based layers need at least 38.2% remaining interconnections, MAM-based layers achieve the same accuracy with only 0.1%.

A Multiply-And-Max/Min Neuron Paradigm for Aggressively Prunable Deep Neural Networks / Prono, Luciano; Bich, Philippe; Boretti, Chiara; Mangia, Mauro; Pareschi, Fabio; Rovatti, Riccardo; Setti, Gianluca. - In: IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS. - ISSN 2162-237X. - STAMPA. - 36:8(2025), pp. 14414-14427. [10.1109/tnnls.2025.3527644]

A Multiply-And-Max/Min Neuron Paradigm for Aggressively Prunable Deep Neural Networks

Prono, Luciano;Bich, Philippe;Boretti, Chiara;Mangia, Mauro;Pareschi, Fabio;Rovatti, Riccardo;Setti, Gianluca

2025

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno del prodotto
	
				2025
			
	Codice DOI
	
				https://dx.doi.org/10.1109/tnnls.2025.3527644
			
	Titolo della Rivista
	
				IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS
			
	Appare nelle tipologie
	
				1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
A_Multiply-And-Max_Min_Neuron_Paradigm_for_Aggressively_Prunable_Deep_Neural_Networks.pdf accesso aperto Descrizione: Editorial version (open access) Tipologia: 2a Post-print versione editoriale / Version of Record Licenza: Creative commons Dimensione 5.35 MB Formato Adobe PDF Visualizza/Apri	5.35 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2998484

PORTO @ Archivio Istituzionale della Ricerca

A Multiply-And-Max/Min Neuron Paradigm for Aggressively Prunable Deep Neural Networks

Prono, Luciano;Bich, Philippe;Boretti, Chiara;Mangia, Mauro;Pareschi, Fabio;Rovatti, Riccardo;Setti, Gianluca

2025

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)