DocumentCode :
1859725
Title :
Audio-Visual Emotion Recognition Using Neural Networks Learned with Hints
Author :
Kun Lu ; Xin Zhang
Author_Institution :
Sch. of Software, Beijing Inst. of Technol., Beijing, China
fYear :
2013
fDate :
26-28 July 2013
Firstpage :
515
Lastpage :
519
Abstract :
This paper presents a neural network (NN) based multimodal fusion classifier for automatic emotion recognition. The audio and visual channels provide complementary information, so we utilize features from three behavioral cues: frontal-view facial expression, profile-view facial expression and vocalization (audio). The problem of interest is to recognize basic emotions, and we use dimensional representation of emotion as the heuristic information (hints) when training NNs used for both single cue processing and multimodal fusion. With the aid of hints, the weights of NNs could learn optimized feature groupings and recognition accuracy of our classifier would be notably improved even when training data is insufficient. Experimental results on audio-visual emotion data recorded by ourselves in Wizard of Oz scenarios and emotion data from the Semaine naturalistic database both demonstrate that our approach is effective and promising.
Keywords :
audio signal processing; emotion recognition; face recognition; feature extraction; image classification; learning (artificial intelligence); neural nets; sensor fusion; speech recognition; NN training; Semaine naturalistic database; Wizard of Oz scenario; audio channel; audio-visual emotion recognition; automatic emotion recognition; behavioral cues; complementary information; emotion dimensional representation; frontal-view facial expression; heuristic information; hints; learning; multimodal fusion classifier; neural network; optimized feature grouping; profile-view facial expression; recognition accuracy; single cue processing; visual channel; vocalization; Artificial neural networks; Databases; Emotion recognition; Facial features; Feature extraction; Hidden Markov models; Visualization; Wizard of Oz scenario; audio-visual fusion; emotion recognition; learned with hints; neural network;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Image and Graphics (ICIG), 2013 Seventh International Conference on
Conference_Location :
Qingdao
Type :
conf
DOI :
10.1109/ICIG.2013.109
Filename :
6643726
Link To Document :
بازگشت