DocumentCode :
2507021
Title :
Emotion Recognition in Spontaneous Speech within Work and Family Environments
Author :
Ling He ; Lech, Margaret ; Maddage, N. ; Memon, Saud A. ; Allen, Nathaniel
Author_Institution :
Sch. of Electr. & Comput. Eng., RMIT Univ., Melbourne, VIC, Australia
fYear :
2009
fDate :
11-13 June 2009
Firstpage :
1
Lastpage :
4
Abstract :
The speech signal is an important tool for conveying information between humans; at the same time, it is an indicator of a speaker´s emotions. In this paper, the automatic identification of affect from speech containing spontaneously expressed (not acted) emotions within different environments was investigated. The teager energy operator-perceptual wavelet packet (TEO-PWP) features as well as the mel frequency cepstral coefficients (MFCC) were used to model the emotions using two classifiers: the Gaussian mixture model (GMM) and the probabilistic neural network (PNN). The classification experiments were conducted using two data sets: SUSAS with three classes (high stress, moderate stress and neutral) and ORI with five classes (angry, happy, anxious, dysphoric and neutral). Depending on the features/classifier combination, the average classification results for the SUSAS data ranged from 95% to 61%, whereas the ORI data provided lower average rates ranging from 57% to 37%. The best overall performance was achieved while using the TEO-PWP in combination with the GMM classifier giving an average of 94.75% correct classifications for the SUSAS data and 56.6% for the ORI data. Different arousal levels between SUSAS and ORI emotional classes were suggested to be most likely cause for the difference in classification rates between these two data sets.
Keywords :
Gaussian processes; cepstral analysis; emotion recognition; feature extraction; medical signal processing; neural nets; signal classification; speech recognition; GMM classifier; Gaussian mixture model; ORI emotional class; TEO-PWP feature extraction; emotion recognition; family environment; mel frequency cepstral coefficient; probabilistic neural network; signal classification; speaker emotion; spontaneous speech; teager energy operator-perceptual wavelet packet; work environment; Emotion recognition; Helium; Hidden Markov models; Humans; Mel frequency cepstral coefficient; Natural languages; Speech analysis; Speech recognition; Stress; Wavelet packets;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedical Engineering , 2009. ICBBE 2009. 3rd International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-2901-1
Electronic_ISBN :
978-1-4244-2902-8
Type :
conf
DOI :
10.1109/ICBBE.2009.5162772
Filename :
5162772
Link To Document :
بازگشت