Title :
A Preliminary Study on GMM Weight Transformation for Emotional Speaker Recognition
Author :
Li Chen ; Yingchun Yang
Author_Institution :
Coll. of Comput. Sci. & Technol., Zhejiang Univ., Hangzhou, China
Abstract :
The performance of speaker recognition system degrades when the emotional states are inconsistent during the enrollment and evaluation stage. Emotional GMM model synthesis, such as NEGT (Neutral-Emotional GMM mean Transformation), is one way to reduce this degradation. This paper discovers that GMM weight transformation is also feasible and the number of parameters that need to be modified is much less than that of GMM mean ransformation. Thus, we propose two algorithms: RBFNN (Radial Basis Function Neural Network) and EBSR (Exemplar Based Sparse Representation) based GMM weight transformation to model the neutral-to-emotion weight transformation law for emotional GMM model synthesis. The experiments carried on MASC show that IR has been increased by 6.91% and 5.74% through these two algorithms respectively, compared with that of the GMM-UBM system. Meanwhile, these two algorithms require less development data and time compared with those of NEGT.
Keywords :
Gaussian processes; radial basis function networks; speaker recognition; EBSR; GMM weight transformation; GMM-UBM system; MASC; NEGT; RBFNN; emotional GMM model synthesis; emotional speaker recognition system; emotional states; enrollment stage; evaluation stage; exemplar based sparse representation; neutral-to-emotion weight transformation law; radial basis function neural network; Acoustics; Adaptation models; Robustness; Signal processing algorithms; Speaker recognition; Speech; Vectors; Emotional Speaker Recognition; Neural Network; Sparse Representation;
Conference_Titel :
Affective Computing and Intelligent Interaction (ACII), 2013 Humaine Association Conference on
Conference_Location :
Geneva
DOI :
10.1109/ACII.2013.12