Neural Networks (NNs) are increasingly adopted in many domains. Given their considerable computational expenses required only during inference, it has been proposed that their architecture can be split into two sections (Head and Tail) in a paradigm referred to as Split Computing (SC). The Head section executes the first part of a NN at a mobile device, while a Tail model calculates the missing part of the NN resorting to a server. SC allows a designer to find the optimal configuration, trading off computing load, performance, latency, and power. However, the adoption of SC approaches in safety-critical domains involves strong reliability concerns. This paper introduces an environment allowing to estimate the effects of permanent faults affecting the hardware supporting the execution of the head model, which is the most prone to faults. Results are reported for different NNs and different configurations, demonstrating the effectiveness of the method and paving the way towards the deployment of more resilient SC systems. Our results show that the considered split configurations of all models under test are up to 10% more critically vulnerable to faults than the corresponding original architecture.
Assessing the Reliability of Different Split Computing Neural Network Applications / Esposito, Giuseppe; Guerrero-Balaguera, Juan-David; Rodriguez Condia, Josie E.; Levorato, Marco; Reorda, Matteo Sonza. - (2024). (Intervento presentato al convegno 2024 IEEE 25th Latin American Test Symposium (LATS) tenutosi a Maceio (BRA) nel 09-12 April 2024) [10.1109/lats62223.2024.10534618].
Assessing the Reliability of Different Split Computing Neural Network Applications
Esposito, Giuseppe;Guerrero-Balaguera, Juan-David;Rodriguez Condia, Josie E.;Levorato, Marco;Reorda, Matteo Sonza
2024
Abstract
Neural Networks (NNs) are increasingly adopted in many domains. Given their considerable computational expenses required only during inference, it has been proposed that their architecture can be split into two sections (Head and Tail) in a paradigm referred to as Split Computing (SC). The Head section executes the first part of a NN at a mobile device, while a Tail model calculates the missing part of the NN resorting to a server. SC allows a designer to find the optimal configuration, trading off computing load, performance, latency, and power. However, the adoption of SC approaches in safety-critical domains involves strong reliability concerns. This paper introduces an environment allowing to estimate the effects of permanent faults affecting the hardware supporting the execution of the head model, which is the most prone to faults. Results are reported for different NNs and different configurations, demonstrating the effectiveness of the method and paving the way towards the deployment of more resilient SC systems. Our results show that the considered split configurations of all models under test are up to 10% more critically vulnerable to faults than the corresponding original architecture.File | Dimensione | Formato | |
---|---|---|---|
Assessing_the_Reliability_of_Different_Split_Computing_Neural_Network_Applications.pdf
accesso riservato
Tipologia:
2a Post-print versione editoriale / Version of Record
Licenza:
Non Pubblico - Accesso privato/ristretto
Dimensione
1.01 MB
Formato
Adobe PDF
|
1.01 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/11583/2990941