Title :
Determining the Smallest Emotional Unit for Level of Arousal Classification
Author :
Vlasenko, Bogdan ; Wendemuth, Andreas
Author_Institution :
Center for Behavioral Brain Sci., Otto von Guericke Univ., Magdeburg, Germany
Abstract :
Most state-of-the-art emotion recognition methods are based on turn- and frame-level analysis independent from phonetic transcription. Currently "affective computing" community could not specify the smallest emotional standard unit which can be easily classified and determined by any "advanced" and "non-advanced" listener. It is known that, acoustic modeling on the smallest phonetic unit (phoneme) started a new era in automatic speech recognition: switch from speaker dependent isolated word recognition to speaker independent continuous speech recognition. In or current research we showed that phoneme can be used as as smallest unit for high and low arousal emotion classification task. We trained our classifications models on the VAM dataset material and evaluated them on speech samples from the DES dataset. For our experiments we employed two different emotion classification approaches: general (phonetic pattern independent) and phoneme-based (phonetic pattern dependent). Both classification approaches used MFFC features extracted on the frame level. Our experimental results impressively show that the proposed phoneme-based classification technique could increase emotion classification performance by about 9.68% absolute (15.98% relative). We showed that phoneme-level emotion models trained on "natural" emotions could provide impressive classification performance on dataset with acted affective content.
Keywords :
emotion recognition; feature extraction; pattern classification; DES dataset; MFFC features; VAM dataset material; arousal emotion classification task; arousal level classification; emotion recognition methods; feature extraction; natural emotions; phoneme-based classification technique; phoneme-level emotion models; phonetic pattern independent approaches; smallest emotional unit determination; speech samples; Acoustics; Emotion recognition; Hidden Markov models; Materials; Speech; Speech recognition; Training; DES; EMO-DB; Emotion recognition; cross-language; emotion perception; emotional unit; level of arousal;
Conference_Titel :
Affective Computing and Intelligent Interaction (ACII), 2013 Humaine Association Conference on
Conference_Location :
Geneva
DOI :
10.1109/ACII.2013.136