DocumentCode :
1347425
Title :
Error Weighted Semi-Coupled Hidden Markov Model for Audio-Visual Emotion Recognition
Author :
Lin, Jen-Chun ; Wu, Chung-Hsien ; Wei, Wen-Li
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
Volume :
14
Issue :
1
fYear :
2012
Firstpage :
142
Lastpage :
156
Abstract :
This paper presents an approach to the automatic recognition of human emotions from audio-visual bimodal signals using an error weighted semi-coupled hidden Markov model (EWSC-HMM). The proposed approach combines an SC-HMM with a state-based bimodal alignment strategy and a Bayesian classifier weighting scheme to obtain the optimal emotion recognition result based on audio-visual bimodal fusion. The state-based bimodal alignment strategy in SC-HMM is proposed to align the temporal relation between audio and visual streams. The Bayesian classifier weighting scheme is then adopted to explore the contributions of the SC-HMM-based classifiers for different audio-visual feature pairs in order to obtain the emotion recognition output. For performance evaluation, two databases are considered: the MHMC posed database and the SEMAINE naturalistic database. Experimental results show that the proposed approach not only outperforms other fusion-based bimodal emotion recognition methods for posed expressions but also provides satisfactory results for naturalistic expressions.
Keywords :
Bayes methods; audio streaming; audio-visual systems; emotion recognition; feature extraction; hidden Markov models; image classification; image fusion; video streaming; visual databases; Bayesian classifier weighting scheme; EWSC-HMM; SC-HMM based classifier; SEMAINE naturalistic database; audio stream; audio visual bimodal fusion; audio visual feature pair; automatic recognition; error weighted semicoupled hidden Markov model; optimal human emotion recognition; performance evaluation; state based bimodal alignment strategy; temporal relation; visual stream; Correlation; Databases; Emotion recognition; Hidden Markov models; Humans; Speech; Visualization; Audio-visual bimodal fusion; emotion recognition; semi-coupled hidden Markov model (SC-HMM);
fLanguage :
English
Journal_Title :
Multimedia, IEEE Transactions on
Publisher :
ieee
ISSN :
1520-9210
Type :
jour
DOI :
10.1109/TMM.2011.2171334
Filename :
6042338
Link To Document :
بازگشت