• DocumentCode
    735056
  • Title

    Speech emotion recognition with i-vector feature and RNN model

  • Author

    Teng Zhang ; Ji Wu

  • Author_Institution
    Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
  • fYear
    2015
  • fDate
    12-15 July 2015
  • Firstpage
    524
  • Lastpage
    528
  • Abstract
    Machine-based emotion recognition from speech has emerged as an important research area in recent years. However, most studies have been done on artificial data. The difficulty of the recognition task increases when we facing natural speech data such as real-world conversations from call centre. Along with that difficulty, there are some new properties which may be useful to the real-world recognition tasks. In this paper, we focus on the recognition task on real-world conversations. Traditional prosodic acoustic features and the novel i-vector features are introduced and compared to represent the speech signal more abstractly. We also propose a Recurrent Neural Network approach to map the features to emotion labels. With only prosodic acoustic features and SVM multi-clasifier, we obtain a f-measure of 38.3%. By adding the i-vector features and the RNN model, we achieve a better result of 48.9%.
  • Keywords
    acoustic signal processing; emotion recognition; feature extraction; recurrent neural nets; signal classification; speech recognition; statistical analysis; support vector machines; RNN model; SVM multiclassifier; call centre; emotion labels; f-measure; i-vector feature; machine-based emotion recognition; natural speech data; prosodic acoustic features; real-world conversations; real-world recognition tasks; recurrent neural network; speech emotion recognition; speech signal representation; Acoustics; Databases; Emotion recognition; Feature extraction; Recurrent neural networks; Speech; Speech recognition; Emotion recognition; Recurrent neural networks; Speech analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal and Information Processing (ChinaSIP), 2015 IEEE China Summit and International Conference on
  • Conference_Location
    Chengdu
  • Type

    conf

  • DOI
    10.1109/ChinaSIP.2015.7230458
  • Filename
    7230458