DocumentCode :
2238984
Title :
Speaker Independent Speech Emotion Recognition by Ensemble Classification
Author :
Schuller, Björn ; Reiter, Stephan ; Müller, Ronald ; Al-Hames, Marc ; Lang, Manfred ; Rigoll, Gerhard
Author_Institution :
Inst. for Human-Machine Commun., Technische Univ. Munchen
fYear :
2005
fDate :
6-6 July 2005
Firstpage :
864
Lastpage :
867
Abstract :
Emotion recognition grows to an important factor in future media retrieval and man machine interfaces. However, even human deciders often experience problems realizing one´s emotion, especially of strangers. In this work we strive to recognize emotion independent of the person concentrating on the speech channel. Single feature relevance of acoustic features is a critical point, which we address by filter-based gain ratio calculation starting at a basis of 276 features. As optimization of a minimum set as a whole in general saves more extraction effort, we furthermore apply an SVM-SFFS wrapper based search. For a more robust estimation we also integrate spoken content information by a Bayesian net analysis of ASR outputs. Overall classification is realized in an early feature fusion by stacked ensembles of diverse base classifiers. Tests ran on a 3,947 movie and automotive interaction dialog-turns database consisting of 35 speakers. Remarkable overall performance can be reported in the discrimination of the seven discrete emotions named in the MPEG-4 standard with added neutrality
Keywords :
belief networks; data compression; emotion recognition; optimisation; pattern classification; speech recognition; support vector machines; telecommunication channels; ASR output; Bayesian net analysis; MPEG-4 standard; SVM-SFFS wrapper search; acoustic feature selection; automatic speech recognition; automotive interaction dialog-turns database; ensemble classification; optimization; sequential forward floating search; speaker independent speech emotion recognition; speech channel; spoken content information; support vector machine; Automatic speech recognition; Bayesian methods; Data mining; Emotion recognition; Humans; Information analysis; Man machine systems; Robustness; Speech recognition; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo, 2005. ICME 2005. IEEE International Conference on
Conference_Location :
Amsterdam
Print_ISBN :
0-7803-9331-7
Type :
conf
DOI :
10.1109/ICME.2005.1521560
Filename :
1521560
Link To Document :
بازگشت