DocumentCode :
2527386
Title :
Real-time Emotion Detection System using Speech: Multi-modal Fusion of Different Timescale Features
Author :
Kim, Samuel ; Georgiou, Panayiotis G. ; Lee, Sungbok ; Narayanan, Shrikanth
Author_Institution :
Southern California Univ., Los Angeles
fYear :
2007
fDate :
1-3 Oct. 2007
Firstpage :
48
Lastpage :
51
Abstract :
The goal of this work is to build a real-time emotion detection system which utilizes multi-modal fusion of different timescale features of speech. Conventional spectral and prosody features are used for intra-frame and supra-frame features respectively, and a new information fusion algorithm which takes care of the characteristics of each machine learning algorithm is introduced. In this framework, the proposed system can be associated with additional features, such as lexical or discourse information, in later steps. To verify the realtime system performance, binary decision tasks on angry and neutral emotion are performed using concatenated speech signal simulating realtime conditions.
Keywords :
emotion recognition; learning (artificial intelligence); sensor fusion; speech processing; speech recognition; binary decision tasks; emotion detection system; information fusion algorithm; intra-frame features; machine learning algorithm; multimodal fusion; prosody features; spectral features; speech; supra-frame features; Automatic speech recognition; Data mining; Emotion recognition; Feature extraction; Loudspeakers; Machine learning algorithms; Mel frequency cepstral coefficient; Real time systems; Speech analysis; Speech processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia Signal Processing, 2007. MMSP 2007. IEEE 9th Workshop on
Conference_Location :
Crete
Print_ISBN :
978-1-4244-1274-7
Type :
conf
DOI :
10.1109/MMSP.2007.4412815
Filename :
4412815
Link To Document :
بازگشت