Systolic array has emerged as a prominent architecture for Deep Neural Network (DNN) hardware accelerators, providing high-throughput and low-latency performance essential for deploying DNNs across diverse applications. However, when used in safety-critical applications, reliability assessment is mandatory to guarantee the correct behavior of DNN accelerators. While fault injection stands out as a well-established practical and robust method for reliability assessment, it is still a very time-consuming process. This paper addresses the time efficiency issue by introducing a novel hierarchical software-based hardware-aware fault injection strategy tailored for systolic array-based DNN accelerators. A uniform Recurrent Equations system is used for software modeling of the systolic-array core of the DNN accelerators. The approach demonstrates a reduction of the fault injection time up to 3x compared to the state-of-the-art hybrid (software/hardware) hardware-aware fault injection frameworks and more than 2000x compared to RT-level fault injection frameworks, without compromising the accuracy from the application level. Additionally, we introduce novel reliability metrics to better evaluate the robustness of a deep neural network system. The performance of the framework is studied on state-of-the-art DNN benchmarks.

SAFFIRA A Framework for Assessing the Reliability of Systolic-Array DNN Accelerators / Pappalardo, Salvatore; Bellarmino, Nicolo'; Deveautour, Bastien; Bosio, Alberto; Taheri, Mahdi; Daneshtalab, Masoud; Raik, Jaan; Jenihhin, Maksim. - In: JOURNAL OF CIRCUITS, SYSTEMS, AND COMPUTERS. - ISSN 0218-1266. - (2025). [10.1142/s0218126625430017]

SAFFIRA A Framework for Assessing the Reliability of Systolic-Array DNN Accelerators

Bellarmino, Nicolo';
2025

Abstract

Systolic array has emerged as a prominent architecture for Deep Neural Network (DNN) hardware accelerators, providing high-throughput and low-latency performance essential for deploying DNNs across diverse applications. However, when used in safety-critical applications, reliability assessment is mandatory to guarantee the correct behavior of DNN accelerators. While fault injection stands out as a well-established practical and robust method for reliability assessment, it is still a very time-consuming process. This paper addresses the time efficiency issue by introducing a novel hierarchical software-based hardware-aware fault injection strategy tailored for systolic array-based DNN accelerators. A uniform Recurrent Equations system is used for software modeling of the systolic-array core of the DNN accelerators. The approach demonstrates a reduction of the fault injection time up to 3x compared to the state-of-the-art hybrid (software/hardware) hardware-aware fault injection frameworks and more than 2000x compared to RT-level fault injection frameworks, without compromising the accuracy from the application level. Additionally, we introduce novel reliability metrics to better evaluate the robustness of a deep neural network system. The performance of the framework is studied on state-of-the-art DNN benchmarks.
File in questo prodotto:
File Dimensione Formato  
Saffira_Extended.pdf

accesso riservato

Tipologia: 2. Post-print / Author's Accepted Manuscript
Licenza: Non Pubblico - Accesso privato/ristretto
Dimensione 7.73 MB
Formato Adobe PDF
7.73 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/3002078