The preliminary diagnosis and evaluation of the presence and/or severity of Parkinson’s disease is crucial in controlling the progress of the disease. Real-time, non-invasive methodologies based on machine learning-enhanced voice analysis are gathering more interest as the potential of this field unveils. Specifically, acoustic features are employed in many machine learning techniques, and could also function as indicators of the overall state of the subjects’ voice: this review aims at identifying the most widely employed and promising feature-based machine learning methodologies, evidencing baselines and state-of-the-art solutions. A total of 102 works plus 5 review articles were selected from the IEEE Xplore, PubMed, Elsevier, and Web of Science electronic databases. A statistical assessment is performed identifying the most frequently used features as well as those deemed as most effective; an overview of algorithms, public datasets, toolboxes, and general metadata is also performed. According to our results, Jitter, Shimmer, Harmonic-to-Noise Ratio, Fundamental Frequency, and Mel Frequency Cepstral Coefficients are the mostly adopted features. In addition, it is worth noting a fair prevalence of glottal-like models and additional filtering options, such as Detrended Fluctuation Analysis.
Machine learning- and statistical-based voice analysis of Parkinson's disease patients: A survey / Amato, F.; Saggio, G.; Cesarini, V.; Olmo, G.; Costantini, G.. - In: EXPERT SYSTEMS WITH APPLICATIONS. - ISSN 0957-4174. - 219:(2023), p. 119651. [10.1016/j.eswa.2023.119651]
Machine learning- and statistical-based voice analysis of Parkinson's disease patients: A survey
Amato F.;Olmo G.;
2023
Abstract
The preliminary diagnosis and evaluation of the presence and/or severity of Parkinson’s disease is crucial in controlling the progress of the disease. Real-time, non-invasive methodologies based on machine learning-enhanced voice analysis are gathering more interest as the potential of this field unveils. Specifically, acoustic features are employed in many machine learning techniques, and could also function as indicators of the overall state of the subjects’ voice: this review aims at identifying the most widely employed and promising feature-based machine learning methodologies, evidencing baselines and state-of-the-art solutions. A total of 102 works plus 5 review articles were selected from the IEEE Xplore, PubMed, Elsevier, and Web of Science electronic databases. A statistical assessment is performed identifying the most frequently used features as well as those deemed as most effective; an overview of algorithms, public datasets, toolboxes, and general metadata is also performed. According to our results, Jitter, Shimmer, Harmonic-to-Noise Ratio, Fundamental Frequency, and Mel Frequency Cepstral Coefficients are the mostly adopted features. In addition, it is worth noting a fair prevalence of glottal-like models and additional filtering options, such as Detrended Fluctuation Analysis.| File | Dimensione | Formato | |
|---|---|---|---|
| 
									
										
										
										
										
											
												
												
												    
												
											
										
									
									
										
										
											1-s2.0-S0957417423001525-main.pdf
										
																				
									
										
											 accesso riservato 
											Tipologia:
											2a Post-print versione editoriale / Version of Record
										 
									
									
									
									
										
											Licenza:
											
											
												Non Pubblico - Accesso privato/ristretto
												
												
												
											
										 
									
									
										Dimensione
										948.88 kB
									 
									
										Formato
										Adobe PDF
									 
										
										
								 | 
								948.88 kB | Adobe PDF | Visualizza/Apri Richiedi una copia | 
| 
									
										
										
										
										
											
												
												
												    
												
											
										
									
									
										
										
											ESWA-D-22-05921_R1.pdf
										
																				
									
										
											 Open Access dal 03/02/2025 
											Tipologia:
											2. Post-print / Author's Accepted Manuscript
										 
									
									
									
									
										
											Licenza:
											
											
												Creative commons
												
												
													
													
													
												
												
											
										 
									
									
										Dimensione
										1.17 MB
									 
									
										Formato
										Adobe PDF
									 
										
										
								 | 
								1.17 MB | Adobe PDF | Visualizza/Apri | 
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/11583/2976268
			
		
	
	
	
			      	