DocumentCode :
838653
Title :
Telephony-based voice pathology assessment using automated speech analysis
Author :
Moran, Rosalyn J. ; Reilly, Richard B. ; De Chazal, Philip ; Lacy, Peter D.
Author_Institution :
Dept. of Electron. & Electr. Eng., Univ. Coll. Dublin, Ireland
Volume :
53
Issue :
3
fYear :
2006
fDate :
3/1/2006 12:00:00 AM
Firstpage :
468
Lastpage :
477
Abstract :
A system for remotely detecting vocal fold pathologies using telephone-quality speech is presented. The system uses a linear classifier, processing measurements of pitch perturbation, amplitude perturbation and harmonic-to-noise ratio derived from digitized speech recordings. Voice recordings from the Disordered Voice Database Model 4337 system were used to develop and validate the system. Results show that while a sustained phonation, recorded in a controlled environment, can be classified as normal or pathologic with accuracy of 89.1%, telephone-quality speech can be classified as normal or pathologic with an accuracy of 74.2%, using the same scheme. Amplitude perturbation features prove most robust for telephone-quality speech. The pathologic recordings were then subcategorized into four groups, comprising normal, neuromuscular pathologic, physical pathologic and mixed (neuromuscular with physical) pathologic. A separate classifier was developed for classifying the normal group from each pathologic subcategory. Results show that neuromuscular disorders could be detected remotely with an accuracy of 87%, physical abnormalities with an accuracy of 78% and mixed pathology voice with an accuracy of 61%. This study highlights the real possibility for remote detection and diagnosis of voice pathology.
Keywords :
medical signal processing; signal classification; speech; speech processing; telephony; Disordered Voice Database Model 4337 system; amplitude perturbation; automated speech analysis; digitized speech recordings; harmonic-to-noise ratio; linear classifier; mixed pathologic recordings; neuromuscular pathologic recordings; physical pathologic recordings; pitch perturbation; sustained phonation; telephony-based voice pathology assessment; vocal fold pathologies; Cepstral analysis; Energy measurement; Hospitals; Mel frequency cepstral coefficient; Neuromuscular; Noise measurement; Pathology; Speech analysis; Speech processing; Time measurement; Speech analysis; voice pathology; voiceXML; Algorithms; Artificial Intelligence; Diagnosis, Computer-Assisted; Humans; Pattern Recognition, Automated; Reproducibility of Results; Sensitivity and Specificity; Sound Spectrography; Speech Disorders; Speech Production Measurement; Speech Recognition Software; Telemedicine; Telephone;
fLanguage :
English
Journal_Title :
Biomedical Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9294
Type :
jour
DOI :
10.1109/TBME.2005.869776
Filename :
1597497
Link To Document :
بازگشت