DocumentCode :
3349273
Title :
Audio-visual based emotion recognition using tripled hidden Markov model
Author :
Song, Mingli ; Chen, Chun ; You, Mingyu
Author_Institution :
Coll. of Comput. Sci., Zhejiang Univ., Hangzhou, China
Volume :
5
fYear :
2004
fDate :
17-21 May 2004
Abstract :
Emotion recognition is one of the latest challenges in intelligent human/machine communication. Most of previous work on emotion recognition focused on extracting emotions from visual or audio information separately. A novel approach is presented in this paper to recognize the human emotion which uses both visual and audio from video clips. A tripled hidden Markov model is introduced to perform the recognition which allows the state asynchrony of the audio and visual observation sequences while preserving their natural correlation over time. The experimental results show that this approach outperforms only using visual or audio separately.
Keywords :
correlation methods; emotion recognition; face recognition; feature extraction; hidden Markov models; man-machine systems; maximum likelihood estimation; speech recognition; video signal processing; Viterbi algorithm; audio-visual based emotion recognition; audio/visual state asynchrony; audio/visual time correlation; human emotion; intelligent human/machine communication; tripled hidden Markov model; video clip audio; visual feature extraction; Emotion recognition; Facial features; Feature extraction; Hidden Markov models; Humans; MPEG 4 Standard; Mouth; Shape; Speech analysis; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8484-9
Type :
conf
DOI :
10.1109/ICASSP.2004.1327251
Filename :
1327251
Link To Document :
بازگشت