Emerging mobile applications often require the execution of computer vision (CV) tasks based on compute- and memory-intensive deep neural networks (DNNs). Although offloading CV tasks to edge servers can decrease resource consumption at the mobile devices, it poses the challenge of handling multiple concurrent tasks with limited computing and memory capacity. In stark opposition with the existing state of the art, we tackle this challenge by jointly optimizing (i) the utilization of resources at the edge, among which memory – so- far widely overlooked – and the radio resources used for task offloading; (ii) which and how many offloaded tasks should be executed; and (iii) the structure of the DNNs. First, we formulate the DNN for scalable Offloading of Tasks (DOT) problem, prove that it is NP-hard, and envision a weighted-tree-based heuristic solution, named OffloaDNN, that efficiently solves the DOT problem. We evaluate OffloaDNN through extensive numerical analysis using state-of-the-art image classification ResNet-18, as well as real-world experiments on the Colosseum emulator. The numerical results show that, in small-scale scenarios, OffloaDNN matches the optimum very closely, and, in larger-scale scenarios, increases the number of admitted offloaded tasks by 26.9% with respect to the state of the art, while saving 82.5% memory and 77.4% per-inference computing time. The numerical results are confirmed by the real-world validation on Colosseum.

OffloaDNN: Shaping DNNs for Scalable Offloading of Computer Vision Tasks at the Edge / Puligheddu, Corrado; Varshney, Nancy; Hassan, Tanzil; Ashdown, Jonathan; Restuccia, Francesco; Chiasserini, Carla Fabiana. - ELETTRONICO. - (2024). (Intervento presentato al convegno IEEE ICDCS 2024 tenutosi a Jersey City (USA) nel July 2024).

OffloaDNN: Shaping DNNs for Scalable Offloading of Computer Vision Tasks at the Edge

Corrado Puligheddu;Nancy Varshney;Carla Fabiana Chiasserini
2024

Abstract

Emerging mobile applications often require the execution of computer vision (CV) tasks based on compute- and memory-intensive deep neural networks (DNNs). Although offloading CV tasks to edge servers can decrease resource consumption at the mobile devices, it poses the challenge of handling multiple concurrent tasks with limited computing and memory capacity. In stark opposition with the existing state of the art, we tackle this challenge by jointly optimizing (i) the utilization of resources at the edge, among which memory – so- far widely overlooked – and the radio resources used for task offloading; (ii) which and how many offloaded tasks should be executed; and (iii) the structure of the DNNs. First, we formulate the DNN for scalable Offloading of Tasks (DOT) problem, prove that it is NP-hard, and envision a weighted-tree-based heuristic solution, named OffloaDNN, that efficiently solves the DOT problem. We evaluate OffloaDNN through extensive numerical analysis using state-of-the-art image classification ResNet-18, as well as real-world experiments on the Colosseum emulator. The numerical results show that, in small-scale scenarios, OffloaDNN matches the optimum very closely, and, in larger-scale scenarios, increases the number of admitted offloaded tasks by 26.9% with respect to the state of the art, while saving 82.5% memory and 77.4% per-inference computing time. The numerical results are confirmed by the real-world validation on Colosseum.
File in questo prodotto:
File Dimensione Formato  
Edge_Offloading___ICDCS_2024.pdf

accesso aperto

Tipologia: 1. Preprint / submitted version [pre- review]
Licenza: PUBBLICO - Tutti i diritti riservati
Dimensione 631.5 kB
Formato Adobe PDF
631.5 kB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2987790