• DocumentCode
    2853699
  • Title

    Neural Networks and TEO Features for an Automatic Recognition of Stress in Spontaneous Speech

  • Author

    He, Ling ; Lech, Margaret ; Maddage, Namunu C. ; Allen, Nicholas

  • Author_Institution
    Sch. of Electr. & Comput. Eng., RMIT Univ., Melbourne, VIC, Australia
  • Volume
    2
  • fYear
    2009
  • fDate
    14-16 Aug. 2009
  • Firstpage
    227
  • Lastpage
    231
  • Abstract
    This study presents automatic stress recognition methods based on acoustic speech analysis. Novel approaches to feature extraction based on the nonlinear Teager energy operator (TEO) calculated within critical bands, discrete wavelet transform bands, and wavelet packet bands are presented. The classification process was performed using two types of neural networks: the multilayer perceptron neural network (MLPNN) and the probabilistic neural network (PNN). The classification efficiency was tested using the actual stress dataset from the SUSAS database. The speech recordings were made by 15 speakers (8 females and 7 males) reading a list of 35 words under three actual conditions: high stress, low stress, and neutral. The best overall performance was observed for the features extracted using the TEO parameters calculated within perceptual wavelet packet bands(TEO-PWP). Depending on the type of mother wavelet, the correct classification scores for the PWP features ranged from 71.24% to 91.56% (using the MLPNN classifier), and from 86.63% to 93.67% (using the PNN). The PNN classifier outperformed the MLPNN classification method.
  • Keywords
    database management systems; discrete wavelet transforms; feature extraction; multilayer perceptrons; speech recognition; MLPNN; SUSAS database; TEO features; automatic recognition; automatic stress recognition methods; discrete wavelet transform; feature extraction; multilayer perceptron neural network; neural networks; nonlinear Teager energy operator; perceptual wavelet packet bands; probabilistic neural network; spontaneous speech stress; stress dataset; wavelet packet bands; Automatic speech recognition; Discrete wavelet transforms; Feature extraction; Multi-layer neural network; Multilayer perceptrons; Neural networks; Speech analysis; Stress; Testing; Wavelet packets; MLPNN; PNN; TEO features; speech classification; stress and emotion recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Computation, 2009. ICNC '09. Fifth International Conference on
  • Conference_Location
    Tianjin
  • Print_ISBN
    978-0-7695-3736-8
  • Type

    conf

  • DOI
    10.1109/ICNC.2009.56
  • Filename
    5365560