DocumentCode :
1944457
Title :
Recognizing emotions for the audio-visual document indexing
Author :
LE, Xuan Hung ; Quénot, Georges ; Castelli, Eric
Author_Institution :
Laboratoire CLIPS-IMAG, Grenoble, France
Volume :
2
fYear :
2004
fDate :
28 June-1 July 2004
Firstpage :
580
Abstract :
In this paper, we proposed using MFCC coefficients (mel-scaled cepstral coefficients) and a simple but efficient classifying method: vector quantification (VQ) to perform speaker-dependent emotion recognition. Many other features: energy, pitch, zero crossing, phonetic rate, LPC... and their derivatives are also tested and combined with MFCC coefficients in order to find the best combination. Other models, GMM and HMM (discrete and continuous hidden Markov model), are studied as well in the hope that the use of continuous distribution and the temporal evolution of this set of features will improve the quality of emotion recognition. The accuracy recognizing five different emotions exceeds 80% by using only MFCC coefficients with VQ model. This is a simple but efficient approach, the result is even much better than those obtained with the same database in human evaluations by listening and judging without returning permission nor comparisons between sentences (Inger Samso Engberg and Anya Varnich Hansen, 2001).
Keywords :
Gaussian processes; audio databases; audio-visual systems; cepstral analysis; database indexing; emotion recognition; hidden Markov models; vector quantisation; MFCC coefficients; audio-visual document indexing; discrete and continuous hidden Markov model; human evaluations; mel-scaled cepstral coefficients; speaker-dependent emotion recognition; vector quantification; Cepstral analysis; Emotion recognition; Hidden Markov models; Humans; Indexing; Linear predictive coding; Mel frequency cepstral coefficient; Permission; Spatial databases; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computers and Communications, 2004. Proceedings. ISCC 2004. Ninth International Symposium on
Print_ISBN :
0-7803-8623-X
Type :
conf
DOI :
10.1109/ISCC.2004.1358600
Filename :
1358600
Link To Document :
بازگشت