Title :
Discrimination of pathological voices using a time-frequency approach
Author :
Umapathy, Karthikeyan ; Krishnan, Sridhar ; Parsa, Vijay ; Jamieson, Donald G.
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Western Ontario, London, Ont., Canada
fDate :
3/1/2005 12:00:00 AM
Abstract :
Acoustical measures of vocal function are routinely used in the assessments of disordered voice, and for monitoring the patient\´s progress over the course of voice therapy. Typically, acoustic measures are extracted from sustained vowel stimuli where short-term and long-term perturbations in fundamental frequency and intensity, and the level of "glottal noise" are used to characterize the vocal function. However, acoustic measures extracted from continuous speech samples may well be required for accurate prediction of abnormal voice quality that is relevant to the client\´s "real world" experience. In contrast with sustained vowel research, there is relatively sparse literature on the effectiveness of acoustic measures extracted from continuous speech samples. This is partially due to the challenge of segmenting the speech signal into voiced, unvoiced, and silence periods before features can be extracted for vocal function characterization. We propose a joint time-frequency approach for classifying pathological voices using continuous speech signals that obviates the need for such segmentation. The speech signals were decomposed using an adaptive time-frequency transform algorithm, and several features such as the octave max, octave mean, energy ratio, length ratio, and frequency ratio were extracted from the decomposition parameters and analyzed using statistical pattern classification techniques. Experiments with a database consisting of continuous speech samples from 51 normal and 161 pathological talkers yielded a classification accuracy of 93.4%.
Keywords :
bioacoustics; feature extraction; medical signal processing; pattern classification; signal classification; speech; speech processing; time-frequency analysis; acoustical measures; adaptive time-frequency transform; continuous speech samples; disordered voice; energy ratio; feature extraction; frequency ratio; glottal noise; length ratio; long-term perturbations; octave max; octave mean; pathological voices; short-term perturbations; speech signal decomposition; speech signal segmentation; statistical pattern classification; vocal function; voice therapy; Acoustic measurements; Acoustic noise; Frequency measurement; Medical treatment; Noise level; Noise measurement; Pathology; Patient monitoring; Speech analysis; Time frequency analysis; Matching pursuit; pathological voice; pattern classification; speech disorders; time-frequency distributions; Algorithms; Databases, Factual; Diagnosis, Computer-Assisted; Fourier Analysis; Humans; Pattern Recognition, Automated; Reproducibility of Results; Sensitivity and Specificity; Sound Spectrography; Speech Disorders; Speech Production Measurement; Time Factors; Voice Disorders;
Journal_Title :
Biomedical Engineering, IEEE Transactions on
DOI :
10.1109/TBME.2004.842962