DocumentCode
705232
Title
A GMM-supervector approach to language recognition with adaptive relevance factor
Author
Chang Huai You ; Haizhou Li ; Kong Aik Lee
Author_Institution
Human Language Technol. Dept., A*STAR, Singapore, Singapore
fYear
2010
fDate
23-27 Aug. 2010
Firstpage
1993
Lastpage
1997
Abstract
Gaussian mixture model (GMM) supervector has been proven effective for language recognition. While a speech utterance can be represented with a GMM which can be obtained through maximum a posteriori (MAP) criterion, it is observed that the supervector formed from the GMM encounters shifting problem in the supervector space due to varying duration of the utterance. We propose an adaptive relevance factor for the MAP estimation to mitigate the negative effect of the variability of individual utterances. Moreover, we develop a language recognition system with a Bhattacharyya-based kernel where the information from the mean vectors and covariance matrices are separately assigned into corresponding dissimilarities. We show the effectiveness of the proposed adaptive relevance factor and the Bhattacharyya-based kernel on the National Institute of Standards and Technology (NIST) language recognition evaluation (LRE) 2009 task.
Keywords
Gaussian processes; covariance matrices; maximum likelihood estimation; mixture models; natural language processing; speech recognition; vectors; Bhattacharyya-based kernel; GMM-supervector approach; Gaussian mixture model; LRE 2009 task; MAP criterion; MAP estimation; NIST; National Institute of Standards and Technology; adaptive relevance factor; covariance matrices; language recognition evaluation; language recognition system; maximum a posteriori criterion; mean vectors; speech utterance; supervector space; Databases; Estimation; Kernel; NIST; Speech; Support vector machines; Training;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2010 18th European
Conference_Location
Aalborg
ISSN
2219-5491
Type
conf
Filename
7096505
Link To Document