DocumentCode :
1340747
Title :
A nonlinear operator-based speech feature analysis method with application to vocal fold pathology assessment
Author :
Hansen, John H L ; Gavidia-Ceballos, Liliana ; Kaiser, James F.
Author_Institution :
Dept. of Electr. & Biomed. Eng., Duke Univ., Durham, NC, USA
Volume :
45
Issue :
3
fYear :
1998
fDate :
3/1/1998 12:00:00 AM
Firstpage :
300
Lastpage :
313
Abstract :
Traditional speech processing methods for laryngeal pathology assessment assume linear speech production with measures derived from an estimated glottal flow waveform. They normally require the speaker to achieve complete glottal closure, which for many vocal fold pathologies cannot be accomplished. To address this issue, a nonlinear signal processing approach is proposed which does not require direct glottal flow waveform estimation. This technique is motivated by earlier studies of airflow characterization for human speech production. The proposed nonlinear approach employs a differential Teager energy operator and the energy separation algorithm to obtain formant AM and FM modulations from filtered speech recordings. A new speech measure is proposed based on parameterization of the autocorrelation envelope of the AM response. This approach is shown to achieve impressive detection performance for a set of muscular tension dysphonias. Unlike flow characterization using numerical solutions of Navier-Stokes equations, this method is extremely computationally attractive, requiring only a small time window of speech samples. The new noninvasive method shows that a fast, effective digital speech processing technique can be developed for vocal fold pathology assessment without the need for direct glottal flow estimation or complete glottal closure by the speaker. The proposed method also confirms that alternative nonlinear methods can begin to address the limitations of previous linear approaches for speech pathology assessment.
Keywords :
medical signal processing; nonlinear acoustics; patient diagnosis; speech processing; airflow characterization; complete glottal closure; differential Teager energy operator; energy separation algorithm; estimated glottal flow waveform; filtered speech recordings; laryngeal pathology assessment; linear speech production; muscular tension dysphonias; nonlinear operator-based speech feature analysis method; nonlinear signal processing approach; vocal fold pathology assessment; Acoustic measurements; Biomedical engineering; Biomedical measurements; Fluid flow measurement; Noise measurement; Pathology; Robustness; Signal processing algorithms; Speech analysis; Speech processing; Adult; Algorithms; Elasticity; Female; Humans; Male; Muscle Contraction; Nonlinear Dynamics; Signal Processing, Computer-Assisted; Speech Discrimination Tests; Speech Production Measurement; Vocal Cords; Voice Disorders;
fLanguage :
English
Journal_Title :
Biomedical Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9294
Type :
jour
DOI :
10.1109/10.661155
Filename :
661155
Link To Document :
بازگشت