On-Hardware Resilience Analysis of DPU-Accelerated CNNs on FPGA-Based Systems

Buccellato, Federico; De Sio, Corrado; Azimi, Sarah; Sterpone, Luca

doi:10.1109/DSD67783.2025.00017

In recent years, Reconfigurable SoCs have emerged as a high-performance solution for embedded systems, addressing the increasing complexity of neural networks, balancing performance, cost, and adaptability. Flexible hardware accelerators, such as AMD’s Deep Learning Processing Units (DPUs), enable efficient computation across various domains, including safety-critical applications. However, soft errors remain a significant reliability concern, especially in harsh environments like space, where radiationinduced corruption of configuration memory poses a significant threat to FPGA-based systems. Most research on the reliability and robustness of deep learning models against soft errors has focused on application-level analyses, with comparatively little attention paid to architectural hardware faults. This paper introduces a resilience evaluation framework targeting AMD’s state-of-the-art DPU, comparing traditional application-level fault injection with hardware-aware fault injection performed on an actual hardware platform, a Kria KV260. Applying this methodology, we evaluated fourteen different deep neural network architectures and demonstrated that hardware-aware fault injections reveal critical vulnerabilities that applicationonly approaches fail to detect. Moreover, we investigated the source of different faults at the hardware level, enabling the identification of architectural resources that are more susceptible to errors. These insights are valuable to support the development of more robust deployment strategies and mitigation techniques tailored to FPGA-based deep learning accelerators.

On-Hardware Resilience Analysis of DPU-Accelerated CNNs on FPGA-Based Systems / Buccellato, Federico; De Sio, Corrado; Azimi, Sarah; Sterpone, Luca. - ELETTRONICO. - (2025), pp. 34-41. ( 28th Euromicro Conference Series on Digital System Design Salerno (ITA) September 10th-12th, 2025) [10.1109/DSD67783.2025.00017].

On-Hardware Resilience Analysis of DPU-Accelerated CNNs on FPGA-Based Systems

Federico Buccellato;Corrado De Sio;Sarah Azimi;Luca Sterpone

2025

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno del prodotto
	
				2025
			
	Codice ISBN
	
				979-8-3315-8499-3
			
	Appare nelle tipologie
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
DSD_2025_CameraReady.pdf accesso aperto Tipologia: 2. Post-print / Author's Accepted Manuscript Licenza: Pubblico - Tutti i diritti riservati Dimensione 1.02 MB Formato Adobe PDF Visualizza/Apri	1.02 MB	Adobe PDF	Visualizza/Apri
On-Hardware_Resilience_Analysis_of_DPUAccelerated_CNNs_on_FPGA-Based_Systems.pdf accesso riservato Tipologia: 2a Post-print versione editoriale / Version of Record Licenza: Non Pubblico - Accesso privato/ristretto Dimensione 561.8 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	561.8 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/3002680

PORTO @ Archivio Istituzionale della Ricerca

On-Hardware Resilience Analysis of DPU-Accelerated CNNs on FPGA-Based Systems

Federico Buccellato;Corrado De Sio;Sarah Azimi;Luca Sterpone

2025

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)