• DocumentCode
    615113
  • Title

    Feature and model level compensation of lexical content for facial emotion recognition

  • Author

    Mariooryad, S. ; Busso, Carlos

  • Author_Institution
    Electr. Eng. Dept., Univ. of Texas Dallas, Richardson, TX, USA
  • fYear
    2013
  • fDate
    22-26 April 2013
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Along with emotions, modulation of the lexical content is an integral aspect of spontaneously produced facial expressions. Hence, the verbal content introduces an undesired variability for solving the facial emotion recognition problem, especially in continuous frame-by-frame analysis during spontaneous human interactions. This study proposes feature and model level compensation approaches to address this problem. The feature level compensation scheme builds upon a trajectory-based modeling of facial features and the whitening transformation of the trajectories. The approach aims to normalize the lexicon-dependent patterns observed in the trajectories. The model level compensation approach builds viseme-dependent emotional classifiers to incorporate the lexical variability. The emotion recognition experiments on the IEMOCAP corpus validate the effectiveness of the proposed techniques both at the viseme and utterance levels. The accuracies of viseme level and utterance level emotion recognitions increase by 2.73% (5.9% relative) and 5.82% (11 % relative), respectively, over a lexicon-independent baseline. These performances represent statistically significant improvements.
  • Keywords
    compensation; emotion recognition; face recognition; feature extraction; image classification; IEMOCAP corpus; continuous frame-by-frame analysis; facial emotion recognition; facial expression; facial feature; feature level compensation; lexical content; lexical variability; lexicon-dependent pattern; lexicon-independent baseline; model level compensation; modulation; spontaneous human interaction; trajectory-based modeling; utterance level emotion recognition; verbal content; viseme level; viseme-dependent emotional classifier; whitening transformation; DH-HEMTs; Emotion recognition; Erbium; Mouth;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Face and Gesture Recognition (FG), 2013 10th IEEE International Conference and Workshops on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-4673-5545-2
  • Electronic_ISBN
    978-1-4673-5544-5
  • Type

    conf

  • DOI
    10.1109/FG.2013.6553752
  • Filename
    6553752