مرکز منطقه ای اطلاع رساني علوم و فناوري - Emotion Recognition in Spontaneous Speech within Work and Family Environments

DocumentCode :

2507021

Title :

Emotion Recognition in Spontaneous Speech within Work and Family Environments

Author :

Ling He ; Lech, Margaret ; Maddage, N. ; Memon, Saud A. ; Allen, Nathaniel

Author_Institution :

Sch. of Electr. & Comput. Eng., RMIT Univ., Melbourne, VIC, Australia

fYear :

2009

fDate :

11-13 June 2009

Firstpage :

Lastpage :

Abstract :

The speech signal is an important tool for conveying information between humans; at the same time, it is an indicator of a speaker´s emotions. In this paper, the automatic identification of affect from speech containing spontaneously expressed (not acted) emotions within different environments was investigated. The teager energy operator-perceptual wavelet packet (TEO-PWP) features as well as the mel frequency cepstral coefficients (MFCC) were used to model the emotions using two classifiers: the Gaussian mixture model (GMM) and the probabilistic neural network (PNN). The classification experiments were conducted using two data sets: SUSAS with three classes (high stress, moderate stress and neutral) and ORI with five classes (angry, happy, anxious, dysphoric and neutral). Depending on the features/classifier combination, the average classification results for the SUSAS data ranged from 95% to 61%, whereas the ORI data provided lower average rates ranging from 57% to 37%. The best overall performance was achieved while using the TEO-PWP in combination with the GMM classifier giving an average of 94.75% correct classifications for the SUSAS data and 56.6% for the ORI data. Different arousal levels between SUSAS and ORI emotional classes were suggested to be most likely cause for the difference in classification rates between these two data sets.

Keywords :

Gaussian processes; cepstral analysis; emotion recognition; feature extraction; medical signal processing; neural nets; signal classification; speech recognition; GMM classifier; Gaussian mixture model; ORI emotional class; TEO-PWP feature extraction; emotion recognition; family environment; mel frequency cepstral coefficient; probabilistic neural network; signal classification; speaker emotion; spontaneous speech; teager energy operator-perceptual wavelet packet; work environment; Emotion recognition; Helium; Hidden Markov models; Humans; Mel frequency cepstral coefficient; Natural languages; Speech analysis; Speech recognition; Stress; Wavelet packets;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Bioinformatics and Biomedical Engineering , 2009. ICBBE 2009. 3rd International Conference on

Conference_Location :

Beijing

Print_ISBN :

978-1-4244-2901-1

Electronic_ISBN :

978-1-4244-2902-8

Type :

conf

DOI :

10.1109/ICBBE.2009.5162772

Filename :

5162772

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2507021