DocumentCode :
2931404
Title :
Recognizing human emotion from audiovisual information
Author :
Wang, Yongjin ; Guan, Ling
Author_Institution :
Dept. of Electr. & Comput. Eng., Toronto Univ., Ont., Canada
Volume :
2
fYear :
2005
fDate :
18-23 March 2005
Abstract :
In this paper, we present an emotion recognition system to classify human emotional state from audiovisual signals. We extract prosodic, mel-frequency cepstral coefficient (MFCC), and formant frequency features to represent the audio characteristics of the emotional speech. A face detection scheme, based on the HSV color model, is used to detect the face from the background. The facial expressions are represented by Gabor wavelet features. We perform feature selection by using a stepwise method based on Mahalanobis distance. A classification scheme involving the analysis of individual class and combinations of different classes is proposed. Our emotion recognition system is tested over a language and race independent database, and an overall recognition accuracy of 82.14% is achieved.
Keywords :
audio signal processing; cepstral analysis; emotion recognition; face recognition; feature extraction; pattern classification; wavelet transforms; Gabor wavelet features; HSV color model; Mahalanobis distance; audio feature extraction; audiovisual information; emotional state classification; face detection; formant frequency features; human emotion recognition; mel-frequency cepstral coefficients; recognition accuracy; stepwise feature selection method; visual feature extraction; Cultural differences; Emotion recognition; Face detection; Hair; Humans; Mel frequency cepstral coefficient; Natural languages; Spatial databases; Speech; System testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2005. Proceedings. (ICASSP '05). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8874-7
Type :
conf
DOI :
10.1109/ICASSP.2005.1415607
Filename :
1415607
Link To Document :
بازگشت