Deep Learning-Based Lip-Reading for Vocal Impaired Patient Rehabilitation

Innocente, Chiara; Boemio, Matteo; Lorenzetti, Gianmarco; Pulito, Ilaria; Romagnoli, Diego; Saponaro, Valeria; Marullo, Giorgia; Ulrich, Luca; Vezzetti, Enrico

doi:10.32604/cmes.2025.063186

Lip-reading technology, based on visual speech decoding and automatic speech recognition, offers a promising solution to overcoming communication barriers, particularly for individuals with temporary or permanent speech impairments. However, most Visual Speech Recognition (VSR) research has primarily focused on the English language and general-purpose applications, limiting its practical applicability in medical and rehabilitative settings. This study introduces the first Deep Learning (DL) based lip-reading system for the Italian language, designed to assist individuals with vocal cord pathologies in daily interactions, facilitating communication for patients recovering from vocal cord surgeries, whether temporarily or permanently impaired. To ensure relevance and effectiveness in real-world scenarios, a carefully curated vocabulary of twenty-five Italian words was selected, encompassing critical semantic fields such as Needs, Questions, Answers, Emergencies, Greetings, Requests, and Body Parts. These words were chosen to address both essential daily communication and urgent medical assistance requests. Our approach combines a spatiotemporal Convolutional Neural Network (CNN) with a bidirectional Long Short-Term Memory (BiLSTM) recurrent network, and a Connectionist Temporal Classification (CTC) loss function to recognize individual words, without requiring predefined words boundaries. The experimental results demonstrate the system’s robust performance in recognizing target words, reaching an average accuracy of 96.4% in individual word recognition, suggesting that the system is particularly well-suited for offering support in constrained clinical and caregiving environments, where quick and reliable communication is critical. In conclusion, the study highlights the importance of developing language-specific, application-driven VSR solutions, particularly for non-English languages with limited linguistic resources. By bridging the gap between deep learning-based lip-reading and real-world clinical needs, this research advances assistive communication technologies, paving the way for more inclusive and medically relevant applications of VSR in rehabilitation and healthcare.

Deep Learning-Based Lip-Reading for Vocal Impaired Patient Rehabilitation / Innocente, C., Boemio, M., Lorenzetti, G., Pulito, I., Romagnoli, D., Saponaro, V., Marullo, G., Ulrich, L., Vezzetti, E.. - In: COMPUTER MODELING IN ENGINEERING & SCIENCES. - ISSN 1526-1506. - ELETTRONICO. - (2025). [10.32604/cmes.2025.063186]

Deep Learning-Based Lip-Reading for Vocal Impaired Patient Rehabilitation

Innocente, Chiara;Boemio, Matteo;Lorenzetti, Gianmarco;Pulito, Ilaria;Romagnoli, Diego;Saponaro, Valeria;Marullo, Giorgia;Ulrich, Luca;Vezzetti, Enrico

2025

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno del prodotto
	
				2025
			
	Codice DOI
	
				https://dx.doi.org/10.32604/cmes.2025.063186
			
	Titolo della Rivista
	
				COMPUTER MODELING IN ENGINEERING & SCIENCES
			
	Appare nelle tipologie
	
				1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
TSP_CMES_63186.pdf accesso aperto Tipologia: 2a Post-print versione editoriale / Version of Record Licenza: Creative commons Dimensione 1.22 MB Formato Adobe PDF Visualizza/Apri	1.22 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2999418

PORTO @ Archivio Istituzionale della Ricerca

Deep Learning-Based Lip-Reading for Vocal Impaired Patient Rehabilitation

Innocente, Chiara;Boemio, Matteo;Lorenzetti, Gianmarco;Pulito, Ilaria;Romagnoli, Diego;Saponaro, Valeria;Marullo, Giorgia;Ulrich, Luca;Vezzetti, Enrico

2025

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)