Neural Networks (NNs) are increasingly adopted in many domains. Given their considerable computational expenses required only during inference, it has been proposed that their architecture can be split into two sections (Head and Tail) in a paradigm referred to as Split Computing (SC). The Head section executes the first part of a NN at a mobile device, while a Tail model calculates the missing part of the NN resorting to a server. SC allows a designer to find the optimal configuration, trading off computing load, performance, latency, and power. However, the adoption of SC approaches in safety-critical domains involves strong reliability concerns. This paper introduces an environment allowing to estimate the effects of permanent faults affecting the hardware supporting the execution of the head model, which is the most prone to faults. Results are reported for different NNs and different configurations, demonstrating the effectiveness of the method and paving the way towards the deployment of more resilient SC systems. Our results show that the considered split configurations of all models under test are up to 10% more critically vulnerable to faults than the corresponding original architecture.

Assessing the Reliability of Different Split Computing Neural Network Applications / Esposito, Giuseppe; Guerrero-Balaguera, Juan-David; Rodriguez Condia, Josie E.; Levorato, Marco; Reorda, Matteo Sonza. - (2024). (Intervento presentato al convegno 2024 IEEE 25th Latin American Test Symposium (LATS) tenutosi a Maceio (BRA) nel 09-12 April 2024) [10.1109/lats62223.2024.10534618].

Assessing the Reliability of Different Split Computing Neural Network Applications

Esposito, Giuseppe;Guerrero-Balaguera, Juan-David;Rodriguez Condia, Josie E.;Levorato, Marco;Reorda, Matteo Sonza
2024

Abstract

Neural Networks (NNs) are increasingly adopted in many domains. Given their considerable computational expenses required only during inference, it has been proposed that their architecture can be split into two sections (Head and Tail) in a paradigm referred to as Split Computing (SC). The Head section executes the first part of a NN at a mobile device, while a Tail model calculates the missing part of the NN resorting to a server. SC allows a designer to find the optimal configuration, trading off computing load, performance, latency, and power. However, the adoption of SC approaches in safety-critical domains involves strong reliability concerns. This paper introduces an environment allowing to estimate the effects of permanent faults affecting the hardware supporting the execution of the head model, which is the most prone to faults. Results are reported for different NNs and different configurations, demonstrating the effectiveness of the method and paving the way towards the deployment of more resilient SC systems. Our results show that the considered split configurations of all models under test are up to 10% more critically vulnerable to faults than the corresponding original architecture.
2024
979-8-3503-6555-9
File in questo prodotto:
File Dimensione Formato  
Assessing_the_Reliability_of_Different_Split_Computing_Neural_Network_Applications.pdf

non disponibili

Tipologia: 2a Post-print versione editoriale / Version of Record
Licenza: Non Pubblico - Accesso privato/ristretto
Dimensione 1.01 MB
Formato Adobe PDF
1.01 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2990941