Speech emotion recognition based on dynamic models

Author

Guoyun Lv ; Shuixian Hu ; Xipan Lu

Author_Institution

Sch. of Electron. & Inf., Northwestern Polytech. Univ., Xi´an, China

fYear

2014

fDate

7-9 July 2014

Firstpage

480

Lastpage

484

Abstract

This paper introduced the semi-continuous Hidden Markov Model (HMM) and proposed a novel Dynamic Bayesian Network (DBN) model for dynamic speech emotion recognition. The former reduces the training complexity caused by mixture Gaussians by sharing the Condition Probability Densities (CPDs) of Gaussians among the states, and the latter adds a sub-state layer between state and observation layer based on traditional DBN framework and describes the dynamic process of speech emotion in detail. Experiments results show that average emotion recognition rate of semi-continuous HMM is 4% and 10% higher than those of classical HMM and Mixture Gaussian HMM respectively, and average emotion recognition rate of the three-layer DBN model is 11% and 8% higher than those of traditional DBN model and semi-continuous HMM.

Keywords

Bayes methods; Gaussian processes; emotion recognition; hidden Markov models; speech recognition; Gaussian CPDs; Gaussian condition probability densities; dynamic Bayesian network model; dynamic speech emotion recognition; semicontinuous HMM; semicontinuous hidden Markov model; three-layer DBN model; Emotion recognition; Feature extraction; Hidden Markov models; Speech; Speech processing; Speech recognition; Support vector machines; dynamic bayesian network; dynamic model; emotion recognition; hidden markov model;

fLanguage

English

Publisher

ieee

Conference_Titel

Audio, Language and Image Processing (ICALIP), 2014 International Conference on

Conference_Location

Shanghai

Print_ISBN

978-1-4799-3902-2

Type

conf

DOI

10.1109/ICALIP.2014.7009840

Filename

7009840