• DocumentCode
    3530258
  • Title

    Emotion recognition from speech: Putting ASR in the loop

  • Author

    Schuller, Björn ; Batliner, Anton ; Steidl, Stefan ; Seppi, Dino

  • Author_Institution
    Inst. for Human-Machine Commun., Tech. Univ. Munchen, Munchen
  • fYear
    2009
  • fDate
    19-24 April 2009
  • Firstpage
    4585
  • Lastpage
    4588
  • Abstract
    This paper investigates the automatic recognition of emotion from spoken words by vector space modeling vs. string kernels which have not been investigated in this respect, yet. Apart from the spoken content directly, we integrate part-of-speech and higher semantic tagging in our analyses. As opposed to most works in the field, we evaluate the performance with an ASR engine in the loop. Extensive experiments are run on the FAU Aibo Emotion Corpus of 4 k spontaneous emotional child-robot interactions and show surprisingly low performance degradation with real ASR over transcription-based emotion recognition. In the result, bag of words dominate over all other modeling forms based on the spoken content.
  • Keywords
    emotion recognition; speech recognition; ASR engine; FAU Aibo Emotion Corpus; automatic emotion recognition; part-of-speech; semantic tagging; spoken words; spontaneous emotional child-robot interactions; string kernels; transcription-based emotion recognition; vector space modeling; Automatic speech recognition; Emotion recognition; Frequency; Kernel; Man machine systems; Space technology; Speech analysis; Speech recognition; Support vector machines; Vocabulary; Feature extraction; Speech analysis; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
  • Conference_Location
    Taipei
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-2353-8
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2009.4960651
  • Filename
    4960651