• DocumentCode
    2915860
  • Title

    Automatic recognition of speech emotion using long-term spectro-temporal features

  • Author

    Wu, Siqing ; Falk, Tiago H. ; Chan, Wai-Yip

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Queen´´s Univ., Kingston, ON, Canada
  • fYear
    2009
  • fDate
    5-7 July 2009
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    This paper proposes a novel feature type for the recognition of emotion from speech. The features are derived from a long-term spectro-temporal representation of speech. They are compared to short-term spectral features as well as popular prosodic features. Experimental results with the Berlin emotional speech database show that the proposed features outperform both types of compared features. An average recognition accuracy of 88.6% is achieved by using a combined proposed & prosodic feature set for classifying 7 discrete emotions. Moreover, the proposed features are evaluated on the VAM corpus to recognize continuous emotion primitives. Estimation performance comparable to human evaluations is furnished.
  • Keywords
    audio databases; emotion recognition; speech recognition; Berlin emotional speech database; continuous emotion primitives recognition; prosodic features; speech emotion; speech long-term spectro-temporal representation; Automatic speech recognition; Band pass filters; Bandwidth; Emotion recognition; Filter bank; Frequency modulation; Humans; Signal resolution; Spatial databases; Speech processing; Emotion recognition; affective computing; spectro-temporal features; speech processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Digital Signal Processing, 2009 16th International Conference on
  • Conference_Location
    Santorini-Hellas
  • Print_ISBN
    978-1-4244-3297-4
  • Electronic_ISBN
    978-1-4244-3298-1
  • Type

    conf

  • DOI
    10.1109/ICDSP.2009.5201047
  • Filename
    5201047