• DocumentCode
    2850522
  • Title

    Stress Detection Using Speech Spectrograms and Sigma-pi Neuron Units

  • Author

    He, Ling ; Lech, Margaret ; Maddage, Namunu C. ; Allen, Nicholas

  • Author_Institution
    Sch. of Electr. & Comput. Eng., RMIT Univ., Melbourne, VIC, Australia
  • Volume
    2
  • fYear
    2009
  • fDate
    14-16 Aug. 2009
  • Firstpage
    260
  • Lastpage
    264
  • Abstract
    This paper presents a new system for automatic stress detection in speech. In the process of feature extraction speech spectrograms were used as the primary features. The sigma-pi neuron cells were then employed to derive the secondary features. The analysis was performed at three alternative sets of analytical frequency bands: critical bands, Bark scale bands and equivalent rectangular bandwidth (ERB) scale bands. The presented algorithm was tested using actual stressful speech utterances from SUSAS (Speech Under Simulated and Actual Stress) database on the vowel-based level. The automatic stress-level classification was implemented using Gaussian mixture model (GMM) and k-nearest neighbor (KNN) classifiers. The strongest effect on the classification results was observed when selecting the type of frequency bands. The ERB scale provided the highest classification results ranging from 67.84% to 73.76%. The classification results did not differ between data sets containing specific types of vowels and data sets containing mixtures of vowels. This indicates that the proposed method can be applied to voiced speech in speech independent conditions.
  • Keywords
    Gaussian processes; feature extraction; pattern classification; speech processing; Bark scale bands; Gaussian mixture model; SUSAS database; Speech Under Simulated and Actual Stress database; analytical frequency bands; automatic stress detection; critical bands; equivalent rectangular bandwidth scale bands; feature extraction; k-nearest neighbor classifiers; sigma-pi neuron cells; speech spectrograms; Bandwidth; Feature extraction; Frequency; Neurons; Performance analysis; Spatial databases; Spectrogram; Speech processing; Stress; Testing; sigma-pi neural units; spectrogram analysis; speech classification; stress recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Computation, 2009. ICNC '09. Fifth International Conference on
  • Conference_Location
    Tianjin
  • Print_ISBN
    978-0-7695-3736-8
  • Type

    conf

  • DOI
    10.1109/ICNC.2009.59
  • Filename
    5365356