مرکز منطقه ای اطلاع رساني علوم و فناوري - Automatic emotion recognition using auditory and prosodic indicative features

DocumentCode :

714182

Title :

Automatic emotion recognition using auditory and prosodic indicative features

Author :

Gharsellaoui, Soumaya ; Selouani, Sid-Ahmed ; Dahmane, Adel Omar

Author_Institution :

Univ. du Quebec a Trois-Rivieres, Trois-Rivieres, QC, Canada

fYear :

2015

fDate :

3-6 May 2015

Firstpage :

1265

Lastpage :

1270

Abstract :

In this paper, a new framework for the automatic recognition of human emotions from speech was proposed. Besides auditory indicative features, selected prosodic and voice quality parameters were optimally combined with Mel frequency coefficients to perform an automatic emotion classification. For this purpose, the Emotion Prosody Speech and Transcript database, a certified speech corpus, was used throughout this study. An extensive set of experiments have been carried out in order to assess the effectiveness of this original mixture of prosodic, perceptual and auditory features to perform the emotion recognition task. These features were selected by using Linear Discriminant Analysis (LDA) and Principal Component Analysis (PCA) on the basis of their ability of discrimination. The selected features were used by the front-end processing stage of a hybrid Gaussian Mixture Model and Support Vector Machines (GSVMs) to perform the emotion classification. The results showed the effectiveness of the proposed feature extraction framework to discriminate between different human emotions when the LDA-PCA-GSVM classifier was used.

Keywords :

Gaussian processes; emotion recognition; feature extraction; mixture models; pattern classification; principal component analysis; set theory; speech recognition; support vector machines; LDA-PCA-GSVM classifier; auditory indicative features; automatic emotion classification; automatic emotion recognition; automatic human emotion recognition; certified speech corpus; emotion prosody speech and transcript database; feature extraction framework; front-end processing stage; hybrid Gaussian mixture model; linear discriminant analysis; mel frequency coefficients; principal component analysis; prosodic indicative features; speech recognition; support vector machines; voice quality parameters; Band-pass filters; Ear; Emotion recognition; Feature extraction; Principal component analysis; Speech; Support vector machines;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Electrical and Computer Engineering (CCECE), 2015 IEEE 28th Canadian Conference on

Conference_Location :

Halifax, NS

ISSN :

0840-7789

Print_ISBN :

978-1-4799-5827-6

Type :

conf

DOI :

10.1109/CCECE.2015.7129460

Filename :

7129460

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=714182