DocumentCode :
3646703
Title :
Audio emotion recognition by perceptual features
Author :
Cenk Sezgin;Bilge Günsel;Canberk Hacioğlu
fYear :
2012
fDate :
4/1/2012 12:00:00 AM
Firstpage :
1
Lastpage :
4
Abstract :
A 9-D perceptual feature set has been used for audio emotion recognition. Performance tests have been performed on well known EMO-DB and VAM databases and the results are reported for different classifiers. Support Vector Machines, Gaussian Mixture Models and Learning Vector Quantization have been used in classification. Audio emotion recognition performance achieved by the perceptual visual features are compared to openEar and GerDa which are cited as state of the art audio emotion recognition systems. It is shown that the 9-D perceptual feature vectors are highly discriminative in continuous emotional space. It is concluded that the learning Vector Quantization increases the performance for natural records, while the Support Vector Machines provide the highest recognition rate for the acted records.
Keywords :
"Emotion recognition","Speech recognition","Conferences","Vector quantization","Harmonic analysis","Speech","Support vector machines"
Publisher :
ieee
Conference_Titel :
Signal Processing and Communications Applications Conference (SIU), 2012 20th
Print_ISBN :
978-1-4673-0055-1
Type :
conf
DOI :
10.1109/SIU.2012.6204799
Filename :
6204799
Link To Document :
بازگشت