DocumentCode
3196207
Title
Vowels formants analysis allows straightforward detection of high arousal emotions
Author
Vlasenko, Bogdan ; Philippou-Hübner, David ; Prylipko, Dmytro ; Böck, Ronald ; Siegert, Ingo ; Wendemuth, Andreas
Author_Institution
Cognitive Systems, IESK, Otto-von-Guericke Universität, Magdeburg, Germany
fYear
2011
fDate
11-15 July 2011
Firstpage
1
Lastpage
6
Abstract
Recently, automatic emotion recognition from speech has achieved growing interest within the human-machine interaction research community. Most part of emotion recognition methods use context independent frame-level analysis or turn-level analysis. In this article, we introduce context dependent vowel level analysis applied for emotion classification. An average first formant value extracted on vowel level has been used as unidimensional acoustic feature vector. The Neyman-Pearson criterion has been used for classification purpose. Our classifier is able to detect high-arousal emotions with small error rates. Within our research we proved that the smallest emotional unit should be the vowel instead of the word. We find out that using vowel level analysis can be an important issue during developing a robust emotion classifier. Also, our research can be useful for developing robust affective speech recognition methods and high quality emotional speech synthesis systems.
Keywords
affective speech; emotion detection; formant analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia and Expo (ICME), 2011 IEEE International Conference on
Conference_Location
Barcelona, Spain
ISSN
1945-7871
Print_ISBN
978-1-61284-348-3
Electronic_ISBN
1945-7871
Type
conf
DOI
10.1109/ICME.2011.6012003
Filename
6012003
Link To Document