Title :
Detection of emotionally significant regions of speech for emotion recognition
Author :
Hari Krishna Vydana;Peddakota Vikash;Tallam Vamsi;Kolla Pavan Kumar;Anil Kumar Vuppala
Author_Institution :
Speech and Vision Lab, IIIT Hyderabad, India
Abstract :
Emotions in human speech are short lived. In an emotive utterance, the emotive gestures produced due to the emotive state of the speaker persists only to a shorter duration. In this study, the regions of an utterance that are highly influenced by the emotive state of the speaker are detected. These regions are labeled as emotionally significant regions. Data from the detected emotionally significant regions is used for developing an emotion recognition system. Physiological constraints of human speech production system are explored for detecting the emotionally significant regions of an utterance. Spectral features from the detected emotionally significant regions are used to develop an emotion recognition system. A significant improvement in the performance of the emotion recognition system is observed in the present approach. An average improvement of 11% is in noted owing to the use of data from emotionally significant regions while developing an emotion recognition system. Gaussian mixture modelling (GMM) technique is employed to develop an emotion recognition system. During the present study, speech samples from Berlin emotion speech database (EMO-DB) are used. Four basic emotions such as anger, happy, neutral and fear are considered for study.
Keywords :
"Speech","Erbium","Emotion recognition","Databases","Speech recognition","Feature extraction","Mel frequency cepstral coefficient"
Conference_Titel :
India Conference (INDICON), 2015 Annual IEEE
Electronic_ISBN :
2325-9418
DOI :
10.1109/INDICON.2015.7443415