DocumentCode :
714182
Title :
Automatic emotion recognition using auditory and prosodic indicative features
Author :
Gharsellaoui, Soumaya ; Selouani, Sid-Ahmed ; Dahmane, Adel Omar
Author_Institution :
Univ. du Quebec a Trois-Rivieres, Trois-Rivieres, QC, Canada
fYear :
2015
fDate :
3-6 May 2015
Firstpage :
1265
Lastpage :
1270
Abstract :
In this paper, a new framework for the automatic recognition of human emotions from speech was proposed. Besides auditory indicative features, selected prosodic and voice quality parameters were optimally combined with Mel frequency coefficients to perform an automatic emotion classification. For this purpose, the Emotion Prosody Speech and Transcript database, a certified speech corpus, was used throughout this study. An extensive set of experiments have been carried out in order to assess the effectiveness of this original mixture of prosodic, perceptual and auditory features to perform the emotion recognition task. These features were selected by using Linear Discriminant Analysis (LDA) and Principal Component Analysis (PCA) on the basis of their ability of discrimination. The selected features were used by the front-end processing stage of a hybrid Gaussian Mixture Model and Support Vector Machines (GSVMs) to perform the emotion classification. The results showed the effectiveness of the proposed feature extraction framework to discriminate between different human emotions when the LDA-PCA-GSVM classifier was used.
Keywords :
Gaussian processes; emotion recognition; feature extraction; mixture models; pattern classification; principal component analysis; set theory; speech recognition; support vector machines; LDA-PCA-GSVM classifier; auditory indicative features; automatic emotion classification; automatic emotion recognition; automatic human emotion recognition; certified speech corpus; emotion prosody speech and transcript database; feature extraction framework; front-end processing stage; hybrid Gaussian mixture model; linear discriminant analysis; mel frequency coefficients; principal component analysis; prosodic indicative features; speech recognition; support vector machines; voice quality parameters; Band-pass filters; Ear; Emotion recognition; Feature extraction; Principal component analysis; Speech; Support vector machines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electrical and Computer Engineering (CCECE), 2015 IEEE 28th Canadian Conference on
Conference_Location :
Halifax, NS
ISSN :
0840-7789
Print_ISBN :
978-1-4799-5827-6
Type :
conf
DOI :
10.1109/CCECE.2015.7129460
Filename :
7129460
Link To Document :
بازگشت