HW-Flow: A Multi-Abstraction Level HW-CNN Codesign Pruning Methodology

Manoj Rohit Vemparala,; Fasfous, Nael; Frickenstein, Alexander; Valpreda, Emanuele; Camalleri, Manfredi; Zhao, Qi; Unger, Christian; Naveen Shankar Nagaraja,; Martina, Maurizio; Stechele, Walter

doi:10.4230/LITES.8.1.3

Convolutional neural networks (CNNs) have produced unprecedented accuracy for many computer vision problems in the recent past. In power and compute-constrained embedded platforms, deploying modern CNNs can present many challenges. Most CNN architectures do not run in real-time due to the high number of computational operations involved during the inference phase. This emphasizes the role of CNN optimization techniques in early design space exploration. To estimate their efficacy in satisfying the target constraints, existing techniques are either hardware (HW) agnostic, pseudo-HW-aware by considering parameter and operation counts, or HW-aware through inflexible hardware-in-the-loop (HIL) setups. In this work, we introduce HW-Flow, a framework for optimizing and exploring CNN models based on three levels of hardware abstraction: Coarse, Mid and Fine. Through these levels, CNN design and optimization can be iteratively refined towards efficient execution on the target hardware platform. We present HW-Flow in the context of CNN pruning by augmenting a reinforcement learning agent with key metrics to understand the influence of its pruning actions on the inference hardware. With 2× reduction in energy and latency, we prune ResNet56, ResNet50, and DeepLabv3 with minimal accuracy degradation on the CIFAR-10, ImageNet, and CityScapes datasets, respectively.

HW-Flow: A Multi-Abstraction Level HW-CNN Codesign Pruning Methodology / Rohit Vemparala, Manoj; Fasfous, Nael; Frickenstein, Alexander; Valpreda, Emanuele; Camalleri, Manfredi; Zhao, Qi; Unger, Christian; Shankar Nagaraja, Naveen; Martina, Maurizio; Stechele, Walter. - In: LEIBNIZ TRANSACTIONS ON EMBEDDED SYSTEMS. - ISSN 2199-2002. - ELETTRONICO. - 8:1(2022), pp. 1-30. [10.4230/LITES.8.1.3]

HW-Flow: A Multi-Abstraction Level HW-CNN Codesign Pruning Methodology

Manoj Rohit Vemparala;Nael Fasfous;Alexander Frickenstein;Emanuele Valpreda;Manfredi Camalleri;Qi Zhao;Christian Unger;Naveen Shankar Nagaraja;Maurizio Martina;Walter Stechele

2022

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno del prodotto
	
				2022
			
	Codice DOI
	
				https://dx.doi.org/10.4230/LITES.8.1.3
			
	Titolo della Rivista
	
				LEIBNIZ TRANSACTIONS ON EMBEDDED SYSTEMS
			
	Appare nelle tipologie
	
				1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
hw-flow_published.pdf accesso aperto Descrizione: versione editore Tipologia: 2a Post-print versione editoriale / Version of Record Licenza: Creative commons Dimensione 8.35 MB Formato Adobe PDF Visualizza/Apri	8.35 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2971412

PORTO @ Archivio Istituzionale della Ricerca

HW-Flow: A Multi-Abstraction Level HW-CNN Codesign Pruning Methodology

Manoj Rohit Vemparala;Nael Fasfous;Alexander Frickenstein;Emanuele Valpreda;Manfredi Camalleri;Qi Zhao;Christian Unger;Naveen Shankar Nagaraja;Maurizio Martina;Walter Stechele

2022

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)