DocumentCode
3528072
Title
Variational Bayesian Joint factor analysis for speaker verification
Author
Zhao, Xianyu ; Dong, Yuan ; Zhao, Jian ; Lu, Liang ; Liu, Jiqing ; Wang, Haila
Author_Institution
France Telecom R&D Center, Beijing
fYear
2009
fDate
19-24 April 2009
Firstpage
4049
Lastpage
4052
Abstract
Joint factor analysis (JFA) has been successfully applied to speaker verification tasks to tackle speaker and session variability. In the sense of Bayesian statistics, it is beneficial to take account of the uncertainties in JFA to better characterize its speaker enrollment and verification processes, e.g. representing target speaker model by posteriori distribution of latent speaker factors and evaluating model likelihood by integrating over all latent factors. However, in a JFA model which has a large number of latent factors, it is computationally demanding to carry out these things in their exact form. In this paper, an alternative approach based on variational Bayesian is developed to explore uncertainties in JFA in an approximate yet efficient way. In this method, fully correlated posteriori distribution is approximated by a variational distribution of factorial form to facilitate inference; and a tight lower bound on model likelihood is derived. Experimental results on the 10sec4w-10sec4w task of the 2006 NIST SRE show that variational Bayesian JFA could obtain better performance than JFA using point estimate.
Keywords
belief networks; maximum likelihood estimation; speaker recognition; statistical distributions; Bayesian statistics; JFA; fully correlated posteriori distribution; joint factor analysis; model likelihood; speaker verification; Bayesian methods; Computational complexity; Independent component analysis; Loudspeakers; Maximum likelihood estimation; NIST; Research and development; Speaker recognition; Telecommunications; Uncertainty; Bayesian statistics; Gaussian mixture model; joint factor analysis; speaker verification; variational approximation;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location
Taipei
ISSN
1520-6149
Print_ISBN
978-1-4244-2353-8
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2009.4960517
Filename
4960517
Link To Document