Face Expression Recognition by Cross Modal Data Association

Author

Tawari, Ashish ; Trivedi, Mohan Manubhai

Author_Institution

Comput. Vision & Robot. Res. Lab., Univ. of California, San Diego, La Jolla, CA, USA

Volume

15

Issue

7

fYear

2013

fDate

Nov. 2013

Firstpage

1543

Lastpage

1552

Abstract

We present a novel facial expression recognition framework using audio-visual information analysis. We propose to model the cross-modality data correlation while allowing them to be treated as asynchronous streams. We also show that our framework can improve the recognition performance while significantly reducing the computational cost by avoiding redundant or insignificant frame processing by incorporating auditory information. In particular, we design a single good image representation of image sequence by weighted sums of registered face images where the weights are derived using auditory features. We use a still image based technique for the expression recognition task. Our framework, however, can be generalized to work with dynamic features as well. We performed experiments using eNTERFACE´05 audio-visual emotional database containing six archetypal emotion classes: Happy, Sad, Surprise, Fear, Anger and Disgust. We present one-to-one binary classification as well as multi-class classification performances evaluated using both subject dependent and independent strategies. Furthermore, we compare multi-class classification accuracies with those of previously published literature which use the same database. Our analyses show promising results.

Keywords

audio-visual systems; emotion recognition; face recognition; image classification; image representation; image sequences; archetypal emotion classes; asynchronous streams; audio-visual information analysis; auditory features; auditory information; cross modal data association; cross-modality data correlation; eNTERFACE´05 audio-visual emotional database; facial expression recognition framework; image representation; image sequence; multiclass classification accuracies; multiclass classification performances; one-to-one binary classification; Facial expression recognition; affect analysis; affective computing; audio-visual expression recognition; emotion recognition; key frames selection; multi-modal expression recognition;

fLanguage

English

Journal_Title

Multimedia, IEEE Transactions on

Publisher

ieee

ISSN

1520-9210

Type

jour

DOI

10.1109/TMM.2013.2266635

Filename

6525327