Exploring Generative Error Correction for Dysarthric Speech Recognition

Moreno La Quatra,; Koudounas, Alkis; Valerio Mario Salerno,; Sabato Marco Siniscalchi,

Despite the remarkable progress in end-to-end Automatic Speech Recognition (ASR) engines, accurately transcribing dysarthric speech remains a major challenge. In this work, we proposed a two-stage framework for the Speech Accessibility Project Challenge at INTERSPEECH 2025, which combines cutting-edge speech recognition models with LLM-based generative error correction (GER). We assess different configurations of model scales and training strategies, incorporating specific hypothesis selection to improve transcription accuracy. Experiments on the Speech Accessibility Project dataset demonstrate the strength of our approach on structured and spontaneous speech, while highlighting challenges in single-word recognition. Through comprehensive analysis, we provide insights into the complementary roles of acoustic and linguistic modeling in dysarthric speech recognition.

Exploring Generative Error Correction for Dysarthric Speech Recognition / La Quatra, Moreno; Koudounas, Alkis; Mario Salerno, Valerio; Marco Siniscalchi, Sabato. - (2025). (Intervento presentato al convegno Interspeech 2025 tenutosi a Rotterdam (NL) nel 17-21 August, 2025).

Exploring Generative Error Correction for Dysarthric Speech Recognition

Moreno La Quatra;Alkis Koudounas;Valerio Mario Salerno;Sabato Marco Siniscalchi

2025

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Anno del prodotto

2025

Appare nelle tipologie

4.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
2505.20163v1.pdf accesso aperto Tipologia: 2. Post-print / Author's Accepted Manuscript Licenza: Pubblico - Tutti i diritti riservati Dimensione 291.14 kB Formato Adobe PDF Visualizza/Apri	291.14 kB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/3002219

PORTO @ Archivio Istituzionale della Ricerca

Exploring Generative Error Correction for Dysarthric Speech Recognition

Moreno La Quatra;Alkis Koudounas;Valerio Mario Salerno;Sabato Marco Siniscalchi

2025

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)