مرکز منطقه ای اطلاع رساني علوم و فناوري - Error Weighted Semi-Coupled Hidden Markov Model for Audio-Visual Emotion Recognition

DocumentCode :

1347425

Title :

Error Weighted Semi-Coupled Hidden Markov Model for Audio-Visual Emotion Recognition

Author :

Lin, Jen-Chun ; Wu, Chung-Hsien ; Wei, Wen-Li

Author_Institution :

Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan

Volume :

Issue :

fYear :

2012

Firstpage :

142

Lastpage :

156

Abstract :

This paper presents an approach to the automatic recognition of human emotions from audio-visual bimodal signals using an error weighted semi-coupled hidden Markov model (EWSC-HMM). The proposed approach combines an SC-HMM with a state-based bimodal alignment strategy and a Bayesian classifier weighting scheme to obtain the optimal emotion recognition result based on audio-visual bimodal fusion. The state-based bimodal alignment strategy in SC-HMM is proposed to align the temporal relation between audio and visual streams. The Bayesian classifier weighting scheme is then adopted to explore the contributions of the SC-HMM-based classifiers for different audio-visual feature pairs in order to obtain the emotion recognition output. For performance evaluation, two databases are considered: the MHMC posed database and the SEMAINE naturalistic database. Experimental results show that the proposed approach not only outperforms other fusion-based bimodal emotion recognition methods for posed expressions but also provides satisfactory results for naturalistic expressions.

Keywords :

Bayes methods; audio streaming; audio-visual systems; emotion recognition; feature extraction; hidden Markov models; image classification; image fusion; video streaming; visual databases; Bayesian classifier weighting scheme; EWSC-HMM; SC-HMM based classifier; SEMAINE naturalistic database; audio stream; audio visual bimodal fusion; audio visual feature pair; automatic recognition; error weighted semicoupled hidden Markov model; optimal human emotion recognition; performance evaluation; state based bimodal alignment strategy; temporal relation; visual stream; Correlation; Databases; Emotion recognition; Hidden Markov models; Humans; Speech; Visualization; Audio-visual bimodal fusion; emotion recognition; semi-coupled hidden Markov model (SC-HMM);

fLanguage :

English

Journal_Title :

Multimedia, IEEE Transactions on

Publisher :

ieee

ISSN :

1520-9210

Type :

jour

DOI :

10.1109/TMM.2011.2171334

Filename :

6042338

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1347425