DocumentCode :
3165873
Title :
Discriminative training of weighted polynomial vector for acoustic language recognition
Author :
Zhang, Ce ; Zheng, Rong ; Xu, Bo
Author_Institution :
Digital Content Technol. Res. Center, Inst. of Autom., Beijing, China
fYear :
2012
fDate :
25-30 March 2012
Firstpage :
4849
Lastpage :
4852
Abstract :
In this paper, we propose a discriminative method for the acoustic feature based language recognizer, which is a modification of the polynomial expansion in generalized linear discriminant sequence (GLDS) kernel. It is inspired by the Gaussian mixture model-support vector machine (GMM-SVM) system which has been successfully used in both speaker and language recognition. Because of the restriction of calculations in our method, it is nearly impossible to stack component dependent polynomial expansion vectors as GM-MSVM system does. Thus we introduce a set of language dependent weights to fuse these expansion vectors and utilize maximum mutual information (MMI) criterion and logistic regression to estimate the model parameters. Finally, we evaluate our method on the close-set, 30 seconds test condition of NIST LRE 2007 and up to 30% relative improvement can be achieved comparing to the baseline GLDS system.
Keywords :
Gaussian processes; regression analysis; speaker recognition; support vector machines; GM-MSVM system; Gaussian mixture model-support vector machine; MMI criterion; NIST LRE 2007; baseline GLDS system; component dependent polynomial expansion vectors; discriminative training; feature based language recognizer; generalized linear discriminant sequence kernel; language dependent weights; logistic regression; maximum mutual information criterion; model parameter estimation; speaker recognition; weighted polynomial vector; Acoustics; Kernel; Logistics; Polynomials; Support vector machines; Training; Vectors; GMM; Language recognition; maximum mutual information; multi-class logistic regression; weighted GLDS;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
ISSN :
1520-6149
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2012.6289005
Filename :
6289005
Link To Document :
بازگشت