مرکز منطقه ای اطلاع رساني علوم و فناوري - Speech emotion recognition with i-vector feature and RNN model

DocumentCode :

735056

Title :

Speech emotion recognition with i-vector feature and RNN model

Author :

Teng Zhang ; Ji Wu

Author_Institution :

Dept. of Electron. Eng., Tsinghua Univ., Beijing, China

fYear :

2015

fDate :

12-15 July 2015

Firstpage :

524

Lastpage :

528

Abstract :

Machine-based emotion recognition from speech has emerged as an important research area in recent years. However, most studies have been done on artificial data. The difficulty of the recognition task increases when we facing natural speech data such as real-world conversations from call centre. Along with that difficulty, there are some new properties which may be useful to the real-world recognition tasks. In this paper, we focus on the recognition task on real-world conversations. Traditional prosodic acoustic features and the novel i-vector features are introduced and compared to represent the speech signal more abstractly. We also propose a Recurrent Neural Network approach to map the features to emotion labels. With only prosodic acoustic features and SVM multi-clasifier, we obtain a f-measure of 38.3%. By adding the i-vector features and the RNN model, we achieve a better result of 48.9%.

Keywords :

acoustic signal processing; emotion recognition; feature extraction; recurrent neural nets; signal classification; speech recognition; statistical analysis; support vector machines; RNN model; SVM multiclassifier; call centre; emotion labels; f-measure; i-vector feature; machine-based emotion recognition; natural speech data; prosodic acoustic features; real-world conversations; real-world recognition tasks; recurrent neural network; speech emotion recognition; speech signal representation; Acoustics; Databases; Emotion recognition; Feature extraction; Recurrent neural networks; Speech; Speech recognition; Emotion recognition; Recurrent neural networks; Speech analysis;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Signal and Information Processing (ChinaSIP), 2015 IEEE China Summit and International Conference on

Conference_Location :

Chengdu

Type :

conf

DOI :

10.1109/ChinaSIP.2015.7230458

Filename :

7230458

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=735056