مرکز منطقه ای اطلاع رساني علوم و فناوري - Speech emotion recognition combining acoustic features and linguistic information in a hybrid support vector machine-belief network architecture

DocumentCode :

3327951

Title :

Speech emotion recognition combining acoustic features and linguistic information in a hybrid support vector machine-belief network architecture

Author :

Schuller, Björn ; Rigoll, Gerhard ; Lang, Manfred

Author_Institution :

Inst. for Human-Comput. Commun., Technische Univ. Munchen, Germany

Volume :

fYear :

2004

fDate :

17-21 May 2004

Abstract :

In this paper we introduce a novel approach to the combination of acoustic features and language information for a most robust automatic recognition of a speaker´s emotion. Seven discrete emotional states are classified throughout the work. Firstly a model for the recognition of emotion by acoustic features is presented. The derived features of the signal-, pitch-, energy, and spectral contours are ranked by their quantitative contribution to the estimation of an emotion. Several different classification methods including linear classifiers, Gaussian mixture models, neural nets, and support vector machines are compared by their performance within this task. Secondly an approach to emotion recognition by the spoken content is introduced applying belief network based spotting for emotional key-phrases. Finally the two information sources are integrated in a soft decision fusion by using a neural net. The gain is evaluated and compared to other advances. Two emotional speech corpora used for training and evaluation are described in detail and the results achieved applying the propagated novel advance to speaker emotion recognition are presented and discussed.

Keywords :

Gaussian distribution; belief networks; emotion recognition; feature extraction; frequency estimation; learning (artificial intelligence); linguistics; neural nets; pattern classification; spectral analysis; support vector machines; Gaussian mixture models; acoustic features; belief network based spotting; discrete emotional states; emotional key-phrases; emotional speech corpora; energy features; hybrid architecture; linear classifiers; linguistic information; neural nets; performance; pitch features; robust automatic recognition; signal features; soft decision fusion; spectral contours; speech emotion recognition; support vector machine; training; Emotion recognition; Information analysis; Information retrieval; Intelligent networks; Natural languages; Neural networks; Robustness; Speech analysis; Support vector machine classification; Support vector machines;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on

ISSN :

1520-6149

Print_ISBN :

0-7803-8484-9

Type :

conf

DOI :

10.1109/ICASSP.2004.1326051

Filename :

1326051

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3327951