This article deals with the analysis of substitution voices in patients who underwent partial laryngectomy for laryngeal cancer, with the aim of identifying a reliable methodology to provide an objective evaluation of post-intervention phonatory impairment and the effectiveness of rehabilitation therapies. The investigated dataset includes 85 patients who underwent type I open partial horizontal laryngectomy (OPHL, 22 subjects), type II OPHL (32 subjects), and type III OPHL (31 subjects). The available vocal material (reading task and sustained vowel) was preprocessed to remove nonharmonic frames from the patients' records using two different algorithms. After this preliminary step, a series of features that belong to time, spectral, and cepstral domains were extracted from the selected harmonic frames. Then, two different comparisons were made between the classes OPHL-I versus OPHL-II + III and the classes OPHL-II + III ( I < 5) versus OPHL-II + III ( I >= 5), where the index I (Intelligibility) of the auditory perceptual scale intelligibility, noise, fluency, and voicing (INFVo) was assessed during a preliminary evaluation. Two different feature-selection techniques, which are based on the comparison among the probability distributions of the extracted features and the classification performance of a logistic regression (LR) model, identified the features with the best discrimination capabilities, which are harmonic-to-noise ratio (HNR), fundamental frequency, spectral kurtosis, spectral entropy, and mel-frequency cepstral coefficients (MFCCs). The best classification accuracy of 96.5% (fivefold cross validation) was obtained in the comparison OPHL-I versus OPHL-II + III using an LR model that was trained using the 5 degrees and 95 degrees percentile of the fundamental frequency and the 95 degrees percentile of the spectral entropy extracted from the reading task.

Vocal-Feature-Based Classification of Post-Laryngectomy Patients for Rehabilitation Monitoring / Carullo, Alessio; Vallan, Alberto; Fantini, Marco; Succo, Giovanni. - In: IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT. - ISSN 0018-9456. - STAMPA. - 72:(2023), pp. 1-9. [10.1109/TIM.2023.3277947]

Vocal-Feature-Based Classification of Post-Laryngectomy Patients for Rehabilitation Monitoring

Alessio Carullo;Alberto Vallan;
2023

Abstract

This article deals with the analysis of substitution voices in patients who underwent partial laryngectomy for laryngeal cancer, with the aim of identifying a reliable methodology to provide an objective evaluation of post-intervention phonatory impairment and the effectiveness of rehabilitation therapies. The investigated dataset includes 85 patients who underwent type I open partial horizontal laryngectomy (OPHL, 22 subjects), type II OPHL (32 subjects), and type III OPHL (31 subjects). The available vocal material (reading task and sustained vowel) was preprocessed to remove nonharmonic frames from the patients' records using two different algorithms. After this preliminary step, a series of features that belong to time, spectral, and cepstral domains were extracted from the selected harmonic frames. Then, two different comparisons were made between the classes OPHL-I versus OPHL-II + III and the classes OPHL-II + III ( I < 5) versus OPHL-II + III ( I >= 5), where the index I (Intelligibility) of the auditory perceptual scale intelligibility, noise, fluency, and voicing (INFVo) was assessed during a preliminary evaluation. Two different feature-selection techniques, which are based on the comparison among the probability distributions of the extracted features and the classification performance of a logistic regression (LR) model, identified the features with the best discrimination capabilities, which are harmonic-to-noise ratio (HNR), fundamental frequency, spectral kurtosis, spectral entropy, and mel-frequency cepstral coefficients (MFCCs). The best classification accuracy of 96.5% (fivefold cross validation) was obtained in the comparison OPHL-I versus OPHL-II + III using an LR model that was trained using the 5 degrees and 95 degrees percentile of the fundamental frequency and the 95 degrees percentile of the spectral entropy extracted from the reading task.
File in questo prodotto:
File Dimensione Formato  
Vocal-Feature-Based_Classification_of_Post-Laryngectomy_Patients_for_Rehabilitation_Monitoring.pdf

non disponibili

Tipologia: 2a Post-print versione editoriale / Version of Record
Licenza: Non Pubblico - Accesso privato/ristretto
Dimensione 7.54 MB
Formato Adobe PDF
7.54 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2990172