AdaKron: An Adapter-based Parameter Efficient Model Tuning with Kronecker Product

Braga, Marco; Alessandro, Raganato; Pasi, Gabriella

The fine-tuning paradigm has been widely adopted to train neural models tailored for specific tasks. However, the recent upsurge of Large Language Models (LLMs), characterized by billions of parameters, has introduced profound computational challenges to the fine-tuning process. This has fueled intensive research on Parameter-Efficient Fine-Tuning (PEFT) techniques, usually involving the training of a selective subset of the original model parameters. One of the most used approaches is Adapters, which add trainable lightweight layers to the existing pretrained weights. Within this context, we propose AdaKron, an Adapter-based fine-tuning with the Kronecker product. In particular, we leverage the Kronecker product to combine the output of two small networks, resulting in a final vector whose dimension is the product of the dimensions of the individual outputs, allowing us to train only 0.55% of the model’s original parameters. We evaluate AdaKron performing a series of experiments on the General Language Understanding Evaluation (GLUE) benchmark, achieving results in the same ballpark as recent state-of-the-art PEFT methods, despite training fewer parameters.

AdaKron: An Adapter-based Parameter Efficient Model Tuning with Kronecker Product / Braga, M., Raganato, A., Pasi, G.. - (2024), pp. 350-357. (LREC-COLING 2024 Torino (ITA) 20-25 Maggio 2024).

AdaKron: An Adapter-based Parameter Efficient Model Tuning with Kronecker Product

Braga, Marco;Raganato Alessandro;Pasi, Gabriella

2024

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Anno del prodotto

2024

Appare nelle tipologie

4.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
2024.lrec-main.32.pdf accesso aperto Tipologia: 2a Post-print versione editoriale / Version of Record Licenza: Creative commons Dimensione 226.96 kB Formato Adobe PDF Visualizza/Apri	226.96 kB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2989005

PORTO @ Archivio Istituzionale della Ricerca

AdaKron: An Adapter-based Parameter Efficient Model Tuning with Kronecker Product

Braga, Marco;Raganato Alessandro;Pasi, Gabriella

2024

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)