Monocular Depth Perception on Microcontrollers for Edge Applications

Peluso, Valentino; Cipolletta, Antonio; Calimera, Andrea; Poggi, Matteo; Tosi, Fabio; Aleotti, Filippo; Mattoccia, Stefano

doi:10.1109/TCSVT.2021.3077395

Depth estimation is crucial in several computer vision applications, and a recent trend in this field aims at inferring such a cue from a single camera. Unfortunately, despite the compelling results achieved, state-of-the-art monocular depth estimation methods are computationally demanding, thus precluding their practical deployment in several application contexts characterized by low-power constraints. Therefore, in this paper, we propose a lightweight Convolutional Neural Network based on a shallow pyramidal architecture, referred to as μ PyD-Net, enabling monocular depth estimation on microcontrollers. The network is trained in a peculiar self-supervised manner leveraging proxy labels obtained through a traditional stereo algorithm. Moreover, we propose optimization strategies aimed at performing computations with quantized 8-bit data and map the high-level description of the network to low-level layers optimized for the target microcontroller architecture. Exhaustive experimental results on standard datasets and an in-depth evaluation with a device belonging to the popular Arm Cortex-M family confirm that obtaining sufficiently accurate monocular depth estimation on microcontrollers is feasible. To the best of our knowledge, our proposal is the first one enabling such remarkable achievement, paving the way for the deployment of monocular depth cues onto the tiny end-nodes of distributed sensor networks.

Monocular Depth Perception on Microcontrollers for Edge Applications / Peluso, Valentino; Cipolletta, Antonio; Calimera, Andrea; Poggi, Matteo; Tosi, Fabio; Aleotti, Filippo; Mattoccia, Stefano. - In: IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY. - ISSN 1051-8215. - 32:3(2022), pp. 1524-1536. [10.1109/TCSVT.2021.3077395]

Monocular Depth Perception on Microcontrollers for Edge Applications

Peluso, Valentino;Cipolletta, Antonio;Calimera, Andrea;Poggi, Matteo;Tosi, Fabio;Aleotti, Filippo;Mattoccia, Stefano

2022

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno del prodotto
	
				2022
			
	Codice DOI
	
				https://dx.doi.org/10.1109/TCSVT.2021.3077395
			
	Titolo della Rivista
	
				IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY
			
	Appare nelle tipologie
	
				1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
main_iris.pdf accesso aperto Tipologia: 2. Post-print / Author's Accepted Manuscript Licenza: Pubblico - Tutti i diritti riservati Dimensione 417.43 kB Formato Adobe PDF Visualizza/Apri	417.43 kB	Adobe PDF	Visualizza/Apri
Monocular_Depth_Perception_on_Microcontrollers_for_Edge_Applications.pdf accesso riservato Tipologia: 2a Post-print versione editoriale / Version of Record Licenza: Non Pubblico - Accesso privato/ristretto Dimensione 3.56 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	3.56 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2903754

PORTO @ Archivio Istituzionale della Ricerca

Monocular Depth Perception on Microcontrollers for Edge Applications

Peluso, Valentino;Cipolletta, Antonio;Calimera, Andrea;Poggi, Matteo;Tosi, Fabio;Aleotti, Filippo;Mattoccia, Stefano

2022

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)