DocumentCode
2801577
Title
Speech Emotion Recognition Using Segmental Level Prosodic Analysis
Author
Koolagudi, Shashidhar G. ; Kumar, Nitin ; Rao, K. Sreenivasa
Author_Institution
Sch. of Inf. Technol., Indian Inst. of Technol. Kharagpur, Kharagpur, India
fYear
2011
fDate
24-25 Feb. 2011
Firstpage
1
Lastpage
5
Abstract
In this paper, prosodic analysis of speech segments is performed to recognise emotions. Speech signal is segmented into words and syllables. Energy and pitch parameters are extracted from utterances, words and syllables separately to develop emotion recognition models. Eight emotions (anger, disgust, fear, happy, neutral, sad, sarcastic and surprise) of simulated emotion speech corpus, IITKGP SESC [1] are used in this work for recognition of emotions. Word boundaries are manually marked for 15 utterances of IITKGP-SESC. Syllable boundaries are detected using vowel onset points (VOPs) as anchor locations. Recognition performance of emotions using segmental level prosodic features is not found to be appreciable, but by combining spectral features along with prosodic features, emotion recognition performance is considerably improved. Support vector machines (SVM) and Gaussian mixture models (GMM) are used to develop emotion models to analyse different speech segments for emotion recognition.
Keywords
emotion recognition; speech recognition; support vector machines; Gaussian mixture model; IITKGP-SESC; SVM; segmental level prosodic analysis; speech emotion recognition; speech segmentation; support vector machine; vowel onset point; Databases; Emotion recognition; Feature extraction; Speech; Speech processing; Speech recognition; Support vector machines;
fLanguage
English
Publisher
ieee
Conference_Titel
Devices and Communications (ICDeCom), 2011 International Conference on
Conference_Location
Mesra
Print_ISBN
978-1-4244-9189-6
Type
conf
DOI
10.1109/ICDECOM.2011.5738536
Filename
5738536
Link To Document