DocumentCode :
336952
Title :
Methods for stress classification: nonlinear TEO and linear speech based features
Author :
Zhou, Guojun ; Hansen, John H L ; Kaiser, James F.
Author_Institution :
Robust Speech Process. Lab., Duke Univ., Durham, NC, USA
Volume :
4
fYear :
1999
fDate :
15-19 Mar 1999
Firstpage :
2087
Abstract :
Speech production variations due to perceptually induced stress contribute significantly to reduced speech processing performance. One approach that can improve the robustness of speech processing (e.g., recognition) algorithms against stress is to formulate an objective classification of speaker stress based upon the acoustic speech signal. An overview of methods for stress classification is presented. First, we review traditional pitch-based methods for stress detection and classification. Second, neural network based stress classifiers with cepstral-based features, as well as wavelet-based classification algorithms are considered. The effect of stress on linear speech features is discussed, followed by the application of linear features and the Teager (1990) energy operator (TEO) based nonlinear features for effective stress classification. A new evaluation for stress classification and assessment is presented using a critical band frequency partition based the TEO feature and the combination of several linear features. Results using NATO databases of actual speech under stress are presented. Finally, we discuss issues relating to stress classification across known and unknown speakers and suggest areas for further research
Keywords :
acoustic signal processing; cepstral analysis; feature extraction; mathematical operators; neural nets; physiology; signal classification; speech processing; speech recognition; NATO databases; Teager energy operator; acoustic speech signal; cepstral-based features; critical band frequency partition; linear speech based features; linear speech features; neural network based stress classifiers; nonlinear TEO; objective classification; perceptually induced stress; pitch-based methods; research; speech processing performance; speech production variations; speech recognition algorithms; stress classification; stress detection; wavelet-based classification algorithms; Acoustic signal detection; Classification algorithms; Loudspeakers; Neural networks; Partitioning algorithms; Robustness; Signal processing; Speech processing; Speech recognition; Stress;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
Conference_Location :
Phoenix, AZ
ISSN :
1520-6149
Print_ISBN :
0-7803-5041-3
Type :
conf
DOI :
10.1109/ICASSP.1999.758344
Filename :
758344
Link To Document :
بازگشت