مرکز منطقه ای اطلاع رساني علوم و فناوري - Audio emotion recognition by perceptual features

DocumentCode :

3646703

Title :

Audio emotion recognition by perceptual features

Author :

Cenk Sezgin;Bilge Günsel;Canberk Hacioğlu

fYear :

2012

fDate :

4/1/2012 12:00:00 AM

Firstpage :

Lastpage :

Abstract :

A 9-D perceptual feature set has been used for audio emotion recognition. Performance tests have been performed on well known EMO-DB and VAM databases and the results are reported for different classifiers. Support Vector Machines, Gaussian Mixture Models and Learning Vector Quantization have been used in classification. Audio emotion recognition performance achieved by the perceptual visual features are compared to openEar and GerDa which are cited as state of the art audio emotion recognition systems. It is shown that the 9-D perceptual feature vectors are highly discriminative in continuous emotional space. It is concluded that the learning Vector Quantization increases the performance for natural records, while the Support Vector Machines provide the highest recognition rate for the acted records.

Keywords :

"Emotion recognition","Speech recognition","Conferences","Vector quantization","Harmonic analysis","Speech","Support vector machines"

Publisher :

ieee

Conference_Titel :

Signal Processing and Communications Applications Conference (SIU), 2012 20th

Print_ISBN :

978-1-4673-0055-1

Type :

conf

DOI :

10.1109/SIU.2012.6204799

Filename :

6204799

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3646703