ferret: a Framework for Benchmarking Explainers on Transformers

Attanasio, Giuseppe; Pastor, Eliana; Di Bonaventura, Chiara; Nozza, Debora

As Transformers are increasingly relied upon to solve complex NLP problems, there is an increased need for their decisions to be humanly interpretable. While several explainable AI (XAI) techniques for interpreting the outputs of transformer-based models have been proposed, there is still a lack of easy access to using and comparing them. We introduce ferret, a Python library to simplify the use and comparisons of XAI methods on transformer-based classifiers. With ferret, users can visualize and compare transformers-based models output explanations using state-of-the-art XAI methods on any free-text or existing XAI corpora. Moreover, users can also evaluate ad-hoc XAI metrics to select the most faithful and plausible explanations. To align with the recently consolidated process of sharing and using transformers-based models from Hugging Face, ferret interfaces directly with its Python library. In this paper, we showcase ferret to benchmark XAI methods used on transformers for sentiment analysis and hate speech detection. We show how specific methods provide consistently better explanations and are preferable in the context of transformer models.

ferret: a Framework for Benchmarking Explainers on Transformers / Attanasio, Giuseppe; Pastor, Eliana; Di Bonaventura, Chiara; Nozza, Debora. - ELETTRONICO. - (2023), pp. 256-266. ( European Chapter of the Association for Computational Linguistics Dubrovnik, Croatia 2-6 May, 2023).

ferret: a Framework for Benchmarking Explainers on Transformers

Attanasio, Giuseppe;Pastor, Eliana;Di Bonaventura, Chiara;Nozza, Debora

2023

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Anno del prodotto

2023

Appare nelle tipologie

4.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
ferret_EACL23_Demo.pdf accesso riservato Descrizione: Articolo principale (postprint referato) Tipologia: 2. Post-print / Author's Accepted Manuscript Licenza: Non Pubblico - Accesso privato/ristretto Dimensione 420.92 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	420.92 kB	Adobe PDF	Visualizza/Apri Richiedi una copia
2023.eacl-demo.29.pdf accesso aperto Tipologia: 2a Post-print versione editoriale / Version of Record Licenza: Creative commons Dimensione 425.4 kB Formato Adobe PDF Visualizza/Apri	425.4 kB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2976777

PORTO @ Archivio Istituzionale della Ricerca

ferret: a Framework for Benchmarking Explainers on Transformers

Attanasio, Giuseppe;Pastor, Eliana;Di Bonaventura, Chiara;Nozza, Debora

2023

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)