DocumentCode
1798682
Title
Speech emotion recognition based on dynamic models
Author
Guoyun Lv ; Shuixian Hu ; Xipan Lu
Author_Institution
Sch. of Electron. & Inf., Northwestern Polytech. Univ., Xi´an, China
fYear
2014
fDate
7-9 July 2014
Firstpage
480
Lastpage
484
Abstract
This paper introduced the semi-continuous Hidden Markov Model (HMM) and proposed a novel Dynamic Bayesian Network (DBN) model for dynamic speech emotion recognition. The former reduces the training complexity caused by mixture Gaussians by sharing the Condition Probability Densities (CPDs) of Gaussians among the states, and the latter adds a sub-state layer between state and observation layer based on traditional DBN framework and describes the dynamic process of speech emotion in detail. Experiments results show that average emotion recognition rate of semi-continuous HMM is 4% and 10% higher than those of classical HMM and Mixture Gaussian HMM respectively, and average emotion recognition rate of the three-layer DBN model is 11% and 8% higher than those of traditional DBN model and semi-continuous HMM.
Keywords
Bayes methods; Gaussian processes; emotion recognition; hidden Markov models; speech recognition; Gaussian CPDs; Gaussian condition probability densities; dynamic Bayesian network model; dynamic speech emotion recognition; semicontinuous HMM; semicontinuous hidden Markov model; three-layer DBN model; Emotion recognition; Feature extraction; Hidden Markov models; Speech; Speech processing; Speech recognition; Support vector machines; dynamic bayesian network; dynamic model; emotion recognition; hidden markov model;
fLanguage
English
Publisher
ieee
Conference_Titel
Audio, Language and Image Processing (ICALIP), 2014 International Conference on
Conference_Location
Shanghai
Print_ISBN
978-1-4799-3902-2
Type
conf
DOI
10.1109/ICALIP.2014.7009840
Filename
7009840
Link To Document