TinyCL: An Efficient Hardware Architecture for Continual Learning on Autonomous Systems

Ressa, Eugenio; Marchisio, Alberto; Martina, Maurizio; Masera, Guido; Shafique, Muhammad

doi:10.1109/ijcnn64981.2025.11228478

The Continuous Learning (CL) paradigm consists of continuously evolving the parameters of the Deep Neural Network (DNN) model to progressively learn to perform new tasks without reducing the performance on previous tasks, i.e., avoiding the so-called catastrophic forgetting. However, the DNN parameter update in CL-based autonomous systems is extremely resource-hungry. The existing DNN accelerators cannot be directly employed in CL because they only support the execution of the forward propagation. Only a few prior architectures execute the backpropagation and weight update, but they lack the control and management for CL. Towards this, we design a hardware architecture, TinyCL, to perform CL on resource-constrained autonomous systems. It consists of a processing unit that executes both forward and backward propagation, and a control unit that manages memory-based CL workload. To minimize the memory accesses, the sliding window of the convolutional layer moves in a snake-like fashion. Moreover, the Multiply-and-Accumulate units can be reconfigured at runtime to execute different operations. As per our knowledge, our proposed TinyCL represents the first hardware accelerator that executes CL on autonomous systems. We synthesize the complete TinyCL architecture in a 65 nm CMOS technology node with the conventional ASIC design flow. It executes 1 epoch of training on a Conv + ReLU + Dense model on the CIFAR10 dataset in 1.76 s, while 1 training epoch of the same model using an Nvidia Tesla P100 GPU takes 103 s, thus achieving a 58× speedup, consuming 86 mW in a 4.74 mm2 die.

TinyCL: An Efficient Hardware Architecture for Continual Learning on Autonomous Systems / Ressa, E., Marchisio, A., Martina, M., Masera, G., Shafique, M.. - ELETTRONICO. - (2025), pp. 1-7. (2025 International Joint Conference on Neural Networks (IJCNN) Rome (Ita) 30 giugno - 5 luglio 2025) [10.1109/ijcnn64981.2025.11228478].

TinyCL: An Efficient Hardware Architecture for Continual Learning on Autonomous Systems

Ressa, Eugenio;Marchisio, Alberto;Martina, Maurizio;Masera, Guido;Shafique, Muhammad

2025

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno del prodotto
	
				2025
			
	Codice ISBN
	
				979-8-3315-1042-8
			
	Appare nelle tipologie
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
TinyCL.pdf accesso riservato Descrizione: Final Tipologia: 2a Post-print versione editoriale / Version of Record Licenza: Non Pubblico - Accesso privato/ristretto Dimensione 1.36 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	1.36 MB	Adobe PDF	Visualizza/Apri Richiedi una copia
IJCNN_25_TinyCL_pdfexpress.pdf accesso aperto Descrizione: Author version Tipologia: 2. Post-print / Author's Accepted Manuscript Licenza: Pubblico - Tutti i diritti riservati Dimensione 677.15 kB Formato Adobe PDF Visualizza/Apri	677.15 kB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/3005207

PORTO @ Archivio Istituzionale della Ricerca

TinyCL: An Efficient Hardware Architecture for Continual Learning on Autonomous Systems

Ressa, Eugenio;Marchisio, Alberto;Martina, Maurizio;Masera, Guido;Shafique, Muhammad

2025

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)