• DocumentCode
    3153791
  • Title

    A hierarchical framework for modeling multimodality and emotional evolution in affective dialogs

  • Author

    Metallinou, Angeliki ; Katsamanis, Athanasios ; Narayanan, Shrikanth

  • Author_Institution
    Signal Anal. & Interpretation Lab. (SAIL), Univ. of Southern California, Los Angeles, CA, USA
  • fYear
    2012
  • fDate
    25-30 March 2012
  • Firstpage
    2401
  • Lastpage
    2404
  • Abstract
    Incorporating multimodal information and temporal context from speakers during an emotional dialog can contribute to improving performance of automatic emotion recognition systems. Motivated by these issues, we propose a hierarchical framework which models emotional evolution within and between emotional utterances, i.e., at the utterance and dialog level respectively. Our approach can incorporate a variety of generative or discriminative classifiers at each level and provides flexibility and extensibility in terms of multimodal fusion; facial, vocal, head and hand movement cues can be included and fused according to the modality and the emotion classification task. Our results using the multimodal, multi-speaker IEMOCAP database indicate that this framework is well-suited for cases where emotions are expressed multimodally and in context, as in many real-life situations.
  • Keywords
    emotion recognition; interactive systems; pattern classification; speaker recognition; automatic emotion recognition system; discriminative classifier; emotion classification task; emotional dialog; emotional evolution; emotional utterance; facial movement cue; generative classifier; hand movement cue; head movement cue; multimodal fusion; multimodal information modeling; multimodal multispeaker IEMOCAP database; speaker temporal context; vocal movement cue; Context; Context modeling; Databases; Emotion recognition; Face; Hidden Markov models; Vectors; dialog modeling; discriminative training; emotion recognition; hierarchical HMM; multimodality;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
  • Conference_Location
    Kyoto
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4673-0045-2
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2012.6288399
  • Filename
    6288399