Title :
A dimensional approach to emotion recognition of speech from movies
Author :
Giannakopoulos, Theodoros ; Pikrakis, Aggelos ; Theodoridis, Sergios
Author_Institution :
Dept. of Inf. & Telecommun., Univ. of Athens, Athens
Abstract :
In this paper we present a novel method for extracting affective information from movies, based on speech data. The method is based on a 2D representation of speech emotions (Emotion Wheel). The goal is twofold. First, to investigate whether the Emotion Wheel offers a good representation for emotions associated with speech signals. To this end, several humans have manually annotated speech data from movies using the Emotion Wheel and the level of disagreement has been computed as a measure of representation quality. The results indicate that the emotion wheel is a good representation of emotions in speech data. Second, a regression approach is adopted, in order to predict the location of an unknown speech segment in the Emotion Wheel. Each speech segment is represented by a vector of ten audio features. The results indicate that the resulting architecture can estimate emotion states of speech from movies, with sufficient accuracy.
Keywords :
emotion recognition; feature extraction; prediction theory; regression analysis; signal representation; speech recognition; audio feature; emotion wheel; movie information extraction; multimedia analysis; regression approach; signal representation; speech emotion recognition; Content based retrieval; Data mining; Emotion recognition; Humans; Informatics; Motion pictures; Music information retrieval; Psychology; Speech; Wheels; Emotion Recognition; Multimedia analysis; Regression;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2009.4959521