DocumentCode :
1899068
Title :
Learning emotional speech by using Dirichlet Process Mixtures
Author :
Ülker, Yener ; Günsel, Bilge ; Sezgin, Cenk
Author_Institution :
Cogulortam Isaret Isleme ve Oruntu Tanima Grubu, Elektron. ve Haberlesme Muh. Bolumu, Istanbul Teknik Univ., Maslak, Turkey
fYear :
2011
fDate :
20-22 April 2011
Firstpage :
992
Lastpage :
995
Abstract :
Our aim in this paper is to illustrate the effectiveness of the Dirichlet Process Mixture (DPM) model for emotional speech class density estimation when the number of Gauss mixture components are unknown. The problem is modeled as a two-class classification problem where the classes are anger and-no-anger. Performance of the algorithm is evaluated on the features extracted from the emotion dataset EMO-DB, it is observed that the prior information inclusion led to increased non-anger recall rate. The introduced feature set performs perceptual analysis in time, spectral and Bark domains based on the Perceptual Evaluation of Audio Quality (PEAQ) model as described by the standard, ITU-R BS.1387-1 which provides a mathematical model resembling the human auditory system. Unlike the existing systems, the proposed feature set learns statistical characteristic of emotional differences hence enables us to represent the statistics of emotional audio with a small number of features.
Keywords :
Gaussian processes; emotion recognition; feature extraction; speech processing; Dirichlet process mixtures; EMO-DB emotion dataset; Gauss mixture components; ITU-R BS.1387-1; emotional audio statistics; emotional speech class density estimation; feature extraction; human auditory system; information inclusion; mathematical model; nonanger recall rate; perceptual evaluation-of-audio quality model; Conferences; Emotion recognition; Feature extraction; Monte Carlo methods; Speech; Speech processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing and Communications Applications (SIU), 2011 IEEE 19th Conference on
Conference_Location :
Antalya
Print_ISBN :
978-1-4577-0462-8
Electronic_ISBN :
978-1-4577-0461-1
Type :
conf
DOI :
10.1109/SIU.2011.5929820
Filename :
5929820
Link To Document :
بازگشت