• DocumentCode
    1798682
  • Title

    Speech emotion recognition based on dynamic models

  • Author

    Guoyun Lv ; Shuixian Hu ; Xipan Lu

  • Author_Institution
    Sch. of Electron. & Inf., Northwestern Polytech. Univ., Xi´an, China
  • fYear
    2014
  • fDate
    7-9 July 2014
  • Firstpage
    480
  • Lastpage
    484
  • Abstract
    This paper introduced the semi-continuous Hidden Markov Model (HMM) and proposed a novel Dynamic Bayesian Network (DBN) model for dynamic speech emotion recognition. The former reduces the training complexity caused by mixture Gaussians by sharing the Condition Probability Densities (CPDs) of Gaussians among the states, and the latter adds a sub-state layer between state and observation layer based on traditional DBN framework and describes the dynamic process of speech emotion in detail. Experiments results show that average emotion recognition rate of semi-continuous HMM is 4% and 10% higher than those of classical HMM and Mixture Gaussian HMM respectively, and average emotion recognition rate of the three-layer DBN model is 11% and 8% higher than those of traditional DBN model and semi-continuous HMM.
  • Keywords
    Bayes methods; Gaussian processes; emotion recognition; hidden Markov models; speech recognition; Gaussian CPDs; Gaussian condition probability densities; dynamic Bayesian network model; dynamic speech emotion recognition; semicontinuous HMM; semicontinuous hidden Markov model; three-layer DBN model; Emotion recognition; Feature extraction; Hidden Markov models; Speech; Speech processing; Speech recognition; Support vector machines; dynamic bayesian network; dynamic model; emotion recognition; hidden markov model;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Audio, Language and Image Processing (ICALIP), 2014 International Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-4799-3902-2
  • Type

    conf

  • DOI
    10.1109/ICALIP.2014.7009840
  • Filename
    7009840