DocumentCode
2485377
Title
A Hybrid PNN-GMM classification scheme for speech emotion recognition
Author
Ser, Wee ; Cen, Ling ; Yu, Zhu Liang
Author_Institution
Centre for Signal Process., Nanyang Technol. Univ., Singapore
fYear
2008
fDate
8-11 Dec. 2008
Firstpage
1
Lastpage
4
Abstract
With the increasing demand for spoken language interfaces in human-computer interactions, automatic recognition of emotional states from human speeches has become of increasing importance. In this paper, we propose a novel hybrid scheme that combines the probabilistic neural network (PNN) and the Gaussian mixture model (GMM) for identifying emotions from speech signals. In order to handle mismatches more effectively, the universal background model (UBM) is incorporated into the GMM, and the resultant model is denoted as UBM-GMM. In the hybrid scheme, the strengths of the PNN and the UBM-GMM are combined through a novel conditional-probability based fusion algorithm. Experimental results show that the proposed scheme is able to achieve higher recognition accuracy than that obtained by using PNN or UBM-GMM alone.
Keywords
Gaussian processes; emotion recognition; human computer interaction; neural nets; pattern classification; probability; speech recognition; Gaussian mixture model; automatic recognition; conditional-probability based fusion algorithm; human-computer interaction; hybrid PNN-GMM classification scheme; probabilistic neural network; speech emotion recognition; spoken language interface; universal background model; Automatic speech recognition; Emotion recognition; Feature extraction; Humans; Neural networks; Signal processing; Speech processing; Speech recognition; Support vector machine classification; Support vector machines;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition, 2008. ICPR 2008. 19th International Conference on
Conference_Location
Tampa, FL
ISSN
1051-4651
Print_ISBN
978-1-4244-2174-9
Electronic_ISBN
1051-4651
Type
conf
DOI
10.1109/ICPR.2008.4761619
Filename
4761619
Link To Document