مرکز منطقه ای اطلاع رساني علوم و فناوري - Emotion recognition of speech based on RNN

DocumentCode :

3122141

Title :

Emotion recognition of speech based on RNN

Author :

Park, Chang-wun ; Lee, Dong-Wook ; Sim, Kwee-Bo

Volume :

fYear :

2002

fDate :

4-5 Nov. 2002

Firstpage :

2210

Abstract :

Emotion recognition has various methods. Mainly, it can be performed by visual or aural methods. We know that it is possible to recognize people´s emotion by only sound data. We use the pitch of speech as a main feature. We define features of four emotions (normal, angry, laugh, surprise) in pitch analysis. Based on this feature pattern, we implement a simulator using VC++. First of all, this simulator is composed of ´generation of individuals´, ´recurrent neural net (RNN)´, and ´evaluation´. Using the result from the learning part of this simulator, we can get results applied to other speech data (excepting for learning data). In detail, each module uses the following method. First, the generation of individuals part uses (1+100)-ES and (1+1)-ES (that is, random). Thus, we observe the comparison result of both methods. Then, we select the best way. Second, the RNN part is composed of 7-nodes. That is, 1 input node, 2 hidden layer nodes, 4 output nodes. Selection of this structure depends on the characteristics of sequentially inputted speech data. Third, the evaluation part is very important. This part is the cause of the extraction speed and satisfaction degree of result. Then we implement a simulator from the above modules. Applied to other speech data, we observe the result of recognition.

Keywords :

emotion recognition; feature extraction; learning (artificial intelligence); recurrent neural nets; speech recognition; angry; center-clipping; emotion recognition; evaluation; feature pattern; generation of individuals; laugh; learning; normal; pitch analysis; recurrent neural network; sound data; speech; surprise; Data mining; Emotion recognition; Neural networks; Pattern recognition; Positron emission tomography; Recurrent neural networks; Robot sensing systems; Speech analysis; Speech recognition; Telephony;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Machine Learning and Cybernetics, 2002. Proceedings. 2002 International Conference on

Print_ISBN :

0-7803-7508-4

Type :

conf

DOI :

10.1109/ICMLC.2002.1175432

Filename :

1175432

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3122141