Miniaturized autonomous unmanned aerial vehicles (UAVs) are gaining popularity due to their small size, enabling new tasks such as indoor navigation or people monitoring. Nonetheless, their size and simple electronics pose severe challenges in implementing advanced onboard intelligence. This work proposes a new automatic optimization pipeline for visual pose estimation tasks using Deep Neural Networks (DNNs). The pipeline leverages two different Neural Architecture Search (NAS) algorithms to pursue a vast complexity-driven exploration in the DNNs’ architectural space. The obtained networks are then deployed on an off-the-shelf nano-drone equipped with a parallel ultra-low power System-on-Chip leveraging a set of novel software kernels for the efficient fused execution of critical DNN layer sequences. Our results improve the state-of-the-art reducing inference latency by up to 3.22x at iso-error.
Optimized Deployment of Deep Neural Networks for Visual Pose Estimation on Nano-drones / Risso, Matteo; Daghero, Francesco; Motetti, BEATRICE ALESSANDRA; JAHIER PAGLIARI, Daniele; Macii, Enrico; Poncino, Massimo; Burrello, Alessio. - ELETTRONICO. - (In corso di stampa). (Intervento presentato al convegno ERF 2024 | European Robotics Forum tenutosi a Rimini (IT) nel March, 13 - 15, 2024).
Optimized Deployment of Deep Neural Networks for Visual Pose Estimation on Nano-drones
Matteo Risso;Francesco Daghero;Beatrice Alessandra Motetti;Daniele Jahier Pagliari;Enrico Macii;Massimo Poncino;Alessio Burrello
In corso di stampa
Abstract
Miniaturized autonomous unmanned aerial vehicles (UAVs) are gaining popularity due to their small size, enabling new tasks such as indoor navigation or people monitoring. Nonetheless, their size and simple electronics pose severe challenges in implementing advanced onboard intelligence. This work proposes a new automatic optimization pipeline for visual pose estimation tasks using Deep Neural Networks (DNNs). The pipeline leverages two different Neural Architecture Search (NAS) algorithms to pursue a vast complexity-driven exploration in the DNNs’ architectural space. The obtained networks are then deployed on an off-the-shelf nano-drone equipped with a parallel ultra-low power System-on-Chip leveraging a set of novel software kernels for the efficient fused execution of critical DNN layer sequences. Our results improve the state-of-the-art reducing inference latency by up to 3.22x at iso-error.File | Dimensione | Formato | |
---|---|---|---|
2402.15273v1.pdf
accesso aperto
Tipologia:
1. Preprint / submitted version [pre- review]
Licenza:
PUBBLICO - Tutti i diritti riservati
Dimensione
815.57 kB
Formato
Adobe PDF
|
815.57 kB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/11583/2992684