• DocumentCode
    336952
  • Title

    Methods for stress classification: nonlinear TEO and linear speech based features

  • Author

    Zhou, Guojun ; Hansen, John H L ; Kaiser, James F.

  • Author_Institution
    Robust Speech Process. Lab., Duke Univ., Durham, NC, USA
  • Volume
    4
  • fYear
    1999
  • fDate
    15-19 Mar 1999
  • Firstpage
    2087
  • Abstract
    Speech production variations due to perceptually induced stress contribute significantly to reduced speech processing performance. One approach that can improve the robustness of speech processing (e.g., recognition) algorithms against stress is to formulate an objective classification of speaker stress based upon the acoustic speech signal. An overview of methods for stress classification is presented. First, we review traditional pitch-based methods for stress detection and classification. Second, neural network based stress classifiers with cepstral-based features, as well as wavelet-based classification algorithms are considered. The effect of stress on linear speech features is discussed, followed by the application of linear features and the Teager (1990) energy operator (TEO) based nonlinear features for effective stress classification. A new evaluation for stress classification and assessment is presented using a critical band frequency partition based the TEO feature and the combination of several linear features. Results using NATO databases of actual speech under stress are presented. Finally, we discuss issues relating to stress classification across known and unknown speakers and suggest areas for further research
  • Keywords
    acoustic signal processing; cepstral analysis; feature extraction; mathematical operators; neural nets; physiology; signal classification; speech processing; speech recognition; NATO databases; Teager energy operator; acoustic speech signal; cepstral-based features; critical band frequency partition; linear speech based features; linear speech features; neural network based stress classifiers; nonlinear TEO; objective classification; perceptually induced stress; pitch-based methods; research; speech processing performance; speech production variations; speech recognition algorithms; stress classification; stress detection; wavelet-based classification algorithms; Acoustic signal detection; Classification algorithms; Loudspeakers; Neural networks; Partitioning algorithms; Robustness; Signal processing; Speech processing; Speech recognition; Stress;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
  • Conference_Location
    Phoenix, AZ
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-5041-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1999.758344
  • Filename
    758344