Multimodal information fusion of audiovisual emotion recognition using novel information theoretic tools

Author

Zhibing Xie ; Ling Guan

Author_Institution

Ryerson Multimedia Res. Lab., Ryerson Univ., Toronto, ON, Canada

fYear

2013

fDate

15-19 July 2013

Firstpage

1

Lastpage

6

Abstract

This paper aims at providing general theoretical analysis for the issue of multimodal information fusion and implementing novel information theoretic tools in multimedia application. The most essential issues for information fusion include feature transformation and reduction of feature dimensionality. Most previous solutions are based on the second order statistics, which is only optimal for Gaussian-like distribution, while in this paper we describe kernel entropy component analysis (KECA) which utilizes descriptor of information entropy and achieves improved performance by entropy estimation. We present a new solution based on the integration of information fusion theory and information theoretic tools in this paper. The proposed method has been applied to audiovisual emotion recognition. Information fusion has been implemented for audio and video channels at feature level and decision level. Experimental results demonstrate that the proposed algorithm achieves improved performance in comparison with the existing methods, especially when the dimension of feature space is substantially reduced.

Keywords

audio-visual systems; emotion recognition; entropy; higher order statistics; sensor fusion; statistical distributions; video signal processing; Gaussian-like distribution; KECA; audio channels; audiovisual emotion recognition; decision level implementation; entropy estimation; feature dimensionality reduction; feature level implementation; feature space reduction; feature transformation; information entropy; information fusion theory; information theoretic tools; kernel entropy component analysis; multimedia application; multimodal information fusion; second order statistics; video channels; Databases; Emotion recognition; Entropy; Feature extraction; Hidden Markov models; Kernel; Vectors; Multimodal information fusion; audiovisual emotion recognition; kernel entropy component analysis;

fLanguage

English

Publisher

ieee

Conference_Titel

Multimedia and Expo (ICME), 2013 IEEE International Conference on

Conference_Location

San Jose, CA

ISSN

1945-7871

Type

conf

DOI

10.1109/ICME.2013.6607464

Filename

6607464