• DocumentCode
    635405
  • Title

    Multimodal information fusion of audiovisual emotion recognition using novel information theoretic tools

  • Author

    Zhibing Xie ; Ling Guan

  • Author_Institution
    Ryerson Multimedia Res. Lab., Ryerson Univ., Toronto, ON, Canada
  • fYear
    2013
  • fDate
    15-19 July 2013
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    This paper aims at providing general theoretical analysis for the issue of multimodal information fusion and implementing novel information theoretic tools in multimedia application. The most essential issues for information fusion include feature transformation and reduction of feature dimensionality. Most previous solutions are based on the second order statistics, which is only optimal for Gaussian-like distribution, while in this paper we describe kernel entropy component analysis (KECA) which utilizes descriptor of information entropy and achieves improved performance by entropy estimation. We present a new solution based on the integration of information fusion theory and information theoretic tools in this paper. The proposed method has been applied to audiovisual emotion recognition. Information fusion has been implemented for audio and video channels at feature level and decision level. Experimental results demonstrate that the proposed algorithm achieves improved performance in comparison with the existing methods, especially when the dimension of feature space is substantially reduced.
  • Keywords
    audio-visual systems; emotion recognition; entropy; higher order statistics; sensor fusion; statistical distributions; video signal processing; Gaussian-like distribution; KECA; audio channels; audiovisual emotion recognition; decision level implementation; entropy estimation; feature dimensionality reduction; feature level implementation; feature space reduction; feature transformation; information entropy; information fusion theory; information theoretic tools; kernel entropy component analysis; multimedia application; multimodal information fusion; second order statistics; video channels; Databases; Emotion recognition; Entropy; Feature extraction; Hidden Markov models; Kernel; Vectors; Multimodal information fusion; audiovisual emotion recognition; kernel entropy component analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo (ICME), 2013 IEEE International Conference on
  • Conference_Location
    San Jose, CA
  • ISSN
    1945-7871
  • Type

    conf

  • DOI
    10.1109/ICME.2013.6607464
  • Filename
    6607464