A hierarchical framework for modeling multimodality and emotional evolution in affective dialogs

Author

Metallinou, Angeliki ; Katsamanis, Athanasios ; Narayanan, Shrikanth

Author_Institution

Signal Anal. & Interpretation Lab. (SAIL), Univ. of Southern California, Los Angeles, CA, USA

fYear

2012

fDate

25-30 March 2012

Firstpage

2401

Lastpage

2404

Abstract

Incorporating multimodal information and temporal context from speakers during an emotional dialog can contribute to improving performance of automatic emotion recognition systems. Motivated by these issues, we propose a hierarchical framework which models emotional evolution within and between emotional utterances, i.e., at the utterance and dialog level respectively. Our approach can incorporate a variety of generative or discriminative classifiers at each level and provides flexibility and extensibility in terms of multimodal fusion; facial, vocal, head and hand movement cues can be included and fused according to the modality and the emotion classification task. Our results using the multimodal, multi-speaker IEMOCAP database indicate that this framework is well-suited for cases where emotions are expressed multimodally and in context, as in many real-life situations.

Keywords

emotion recognition; interactive systems; pattern classification; speaker recognition; automatic emotion recognition system; discriminative classifier; emotion classification task; emotional dialog; emotional evolution; emotional utterance; facial movement cue; generative classifier; hand movement cue; head movement cue; multimodal fusion; multimodal information modeling; multimodal multispeaker IEMOCAP database; speaker temporal context; vocal movement cue; Context; Context modeling; Databases; Emotion recognition; Face; Hidden Markov models; Vectors; dialog modeling; discriminative training; emotion recognition; hierarchical HMM; multimodality;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on

Conference_Location

Kyoto

ISSN

1520-6149

Print_ISBN

978-1-4673-0045-2

Electronic_ISBN

1520-6149

Type

conf

DOI

10.1109/ICASSP.2012.6288399

Filename

6288399