Title :
Automatic Speech Emotion Recognition using Support Vector Machine
Author :
Peipei Shen ; Zhou Changjun ; Xiong Chen
Author_Institution :
Dept. of Comput. Technol., Shanghai Jiao Tong Univ., Shanghai, China
Abstract :
Automatic Speech Emotion Recognition (SER) is a current research topic in the field of Human Computer Interaction (HCI) with wide range of applications. The purpose of speech emotion recognition system is to automatically classify speaker´s utterances into five emotional states such as disgust, boredom, sadness, neutral, and happiness. The speech samples are from Berlin emotional database and the features extracted from these utterances are energy, pitch, linear prediction cepstrum coefficients (LPCC), Mel Frequency cepstrum coefficients (MFCC), Linear Prediction coefficients and Mel cepstrum coefficients (LPCMCC). The Support Vector Machine (SVM) is used as a classifier to classify different emotional states. The system gives 66.02% classification accuracy for only using energy and pitch features, 70.7% for only using LPCMCC features, and 82.5% for using both of them.
Keywords :
emotion recognition; human computer interaction; speech recognition; support vector machines; Berlin emotional database; HCI; Mel frequency cepstrum coefficients; SVM; automatic speech emotion recognition; boredom state; disgust state; emotional states; energy features; happiness state; human computer interaction; linear prediction cepstrum coefficients; neutral state; pitch features; sadness state; speaker utterances; support vector machine; Cepstrum; Emotion recognition; Feature extraction; Mel frequency cepstral coefficient; Speech; Speech recognition; Support vector machines; Automatic Emotion Recognition; Energy; LPCC; LPCMCC; MFCC; Pitch; SVM; Speech Emotion;
Conference_Titel :
Electronic and Mechanical Engineering and Information Technology (EMEIT), 2011 International Conference on
Conference_Location :
Harbin, Heilongjiang, China
Print_ISBN :
978-1-61284-087-1
DOI :
10.1109/EMEIT.2011.6023178