Title :
Emotions in speech - experiments with prosody and quality features in speech for use in categorical and dimensional emotion recognition environments
Author :
Borchert, Martin ; Dusterhoft, A.
Author_Institution :
Dept. of Electr. Eng., Univ. of Bus., Technol. & Design, Wismar, Germany
fDate :
30 Oct.-1 Nov. 2005
Abstract :
This paper focuses on features in speech and classification algorithms for using them in emotion recognition software. We illustrate a new approach concentrated on analyzing speech quality features. The quality features are formants, spectral energy distribution in different frequency bands, harmonics-to-noise ratio (in different frequency bands) and irregularities (jitter, shimmer). Some papers (A. Dusterhoft et al., 2003, D. Goleman, 1995) show that there is a relationship between quality features and the valence axis. This paper deals with dimensional approach to classify emotions. Therefore, mainly quality features are taken for the valence axis to classify emotions and mainly prosody features are taken for the arousal axis. Because our experiments show that single emotion recognition rates are up to 90 percent and the recognition rates for speaker independent recognition is about 70% for all classification algorithm, it seems that quality features are more appropriate to differentiate emotions with the same arousal and different valence levels in a dimensional approach. A prototypical emotion recognition software is implemented which is actually tested for analyzing the mood of customers in call centers.
Keywords :
call centres; emotion recognition; signal classification; speech recognition; call centers; categorical emotion recognition; dimensional emotion recognition software; emotion classification algorithms; harmonics-to-noise ratio; prosody features; speaker independent recognition; spectral energy distribution; speech quality features; Algorithm design and analysis; Classification algorithms; Emotion recognition; Frequency; Java; Mood; Software algorithms; Software prototyping; Software quality; Speech analysis;
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2005. IEEE NLP-KE '05. Proceedings of 2005 IEEE International Conference on
Print_ISBN :
0-7803-9361-9
DOI :
10.1109/NLPKE.2005.1598724