Title :
Speech Emotion Recognition Research Based on the Stacked Generalization Ensemble Neural Network for Robot Pet
Author :
Huang, Yongming ; Zhang, Guobao ; Xu, Xiaoli
Author_Institution :
Sch. of Autom., Southeast Univ., Nanjing, China
Abstract :
In this paper, we present an emotion recognition system using the stacked generalization ensemble neural network for special human affective state in the speech signal. 450 short emotional sentences with different contents from 3 speakers were collected as experiment materials. The features relevant with energy, speech rate, pitch and formant are extracted from speech signals. Stacked generalization ensemble neural networks are used as the classifier for 5 emotions including anger, calmness, happiness, sadness and boredom. First, compared with the traditional BP network or wavelet neural network, the results of experiments show that the stacked generalization ensemble neural network has faster convergence speed and higher recognition rate. Second, after discussing the advantage and disadvantage between different ensemble neural networks, suitable decision will be made for robot pet.
Keywords :
emotion recognition; feature extraction; intelligent robots; neurocontrollers; signal classification; speech recognition; BP network; convergence speed; intelligent robot pet; special human affective state; speech emotion recognition system; speech signal extraction; stacked generalization ensemble neural network classifier; wavelet neural network; Emotion recognition; Filtering; Frequency; Humans; Linear predictive coding; Neural networks; Positron emission tomography; Robotics and automation; Robots; Speech processing;
Conference_Titel :
Pattern Recognition, 2009. CCPR 2009. Chinese Conference on
Conference_Location :
Nanjing
Print_ISBN :
978-1-4244-4199-0
DOI :
10.1109/CCPR.2009.5344020