DocumentCode :
2053080
Title :
Multi-class UBM-based MLLR m-vector system for speaker verification
Author :
Sarkar, A.K. ; Barras, Claude
Author_Institution :
LIMSI, Univ. Paris-Sud, Orsay, France
fYear :
2013
fDate :
9-13 Sept. 2013
Firstpage :
1
Lastpage :
5
Abstract :
In this paper, we extend the recently introduced Maximum Likelihood Linear Regression (MLLR) super-vector based m-vector speaker verification system to a multi-class MLLR m-vector system. In the conventional case, global class MLLR transformation is estimated with respect to Universal Background Model (UBM) for a given speech data, which is then used in the form of super-vector for m-vector system. In the proposed system, Gaussian mean vectors of the UBM are first clustered into several classes. Then, MLLR transformations are estimated (of a speech data) for each class, and are used in the form of super-vectors for speaker characterization using the m-vector technique. We consider two clustering approaches: one is based on the conventional K-means and the other is proposed based on Expectation Maximization (EM) and Maximum Likelihood (ML). Both systems yield better performance than the conventional m-vector system and allow for multiple MLLR transforms without additional temporal alignment of the data with respect to UBM. Furthermore, we show that, contrary to conventional K-means, the proposed clustering is not affected by the random initialization, and also provides equal or comparable system performance. The system performances are shown on NIST 2008 SRE core condition over various tasks.
Keywords :
Gaussian processes; expectation-maximisation algorithm; regression analysis; speaker recognition; speech processing; Gaussian mean vectors; MLLR transformation; conventional K-means; expectation maximization approach; m-vector technique; maximum likelihood approach; maximum likelihood linear regression; multi-class MLLR m-vector system; multi-class UBM-based MLLR; speaker characterization; speech data; super-vector based m-vector speaker verification system; universal background model; Clustering algorithms; Estimation; Hidden Markov models; NIST; Speech; System performance; Vectors; MLLR super-vector; Multi-class m-vector; Speaker verification; Statistical clustering algorithm; UBM;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference (EUSIPCO), 2013 Proceedings of the 21st European
Conference_Location :
Marrakech
Type :
conf
Filename :
6811429
Link To Document :
بازگشت