Title :
Emotion recognition from speech signal using mel-frequency cepstral coefficients
Author :
Onur Erdem Korkmaz;Ayten Atasoy
Author_Institution :
Ataturk University, Department of ?spir Hamza Polat Vocational College, Erzurum, Turkey
Abstract :
In this paper, mel-frequency cepstral coefficients are investigated for emotional content of speech signal. The features are extracted from spoken utterance. When these features are extracted, speech signal is divided small frames and each frame overlap a part of previous frame. The purpose of this overlap operation is to provide a smooth transition from one frame to the other and, to prevent information loss in the end of the frame. The length of frame and scroll time is important for emotion recognition applications. Also, we investigated the effects of different length frames and scroll times on the classification success of four emotions which are defined as happy, angry, neutral and sad. Those emotions were classified by using Support Vector Machine and k-Nearest Neighbors algorithms. In this study to determine the classification success, 10-Fold Cross Validation method was used and the maximum success rate was obtained as 98.7 %.
Keywords :
"Speech","Support vector machines","Speech recognition","Emotion recognition","Feature extraction","Databases","Cepstral analysis"
Conference_Titel :
Electrical and Electronics Engineering (ELECO), 2015 9th International Conference on
DOI :
10.1109/ELECO.2015.7394435