• DocumentCode
    3528072
  • Title

    Variational Bayesian Joint factor analysis for speaker verification

  • Author

    Zhao, Xianyu ; Dong, Yuan ; Zhao, Jian ; Lu, Liang ; Liu, Jiqing ; Wang, Haila

  • Author_Institution
    France Telecom R&D Center, Beijing
  • fYear
    2009
  • fDate
    19-24 April 2009
  • Firstpage
    4049
  • Lastpage
    4052
  • Abstract
    Joint factor analysis (JFA) has been successfully applied to speaker verification tasks to tackle speaker and session variability. In the sense of Bayesian statistics, it is beneficial to take account of the uncertainties in JFA to better characterize its speaker enrollment and verification processes, e.g. representing target speaker model by posteriori distribution of latent speaker factors and evaluating model likelihood by integrating over all latent factors. However, in a JFA model which has a large number of latent factors, it is computationally demanding to carry out these things in their exact form. In this paper, an alternative approach based on variational Bayesian is developed to explore uncertainties in JFA in an approximate yet efficient way. In this method, fully correlated posteriori distribution is approximated by a variational distribution of factorial form to facilitate inference; and a tight lower bound on model likelihood is derived. Experimental results on the 10sec4w-10sec4w task of the 2006 NIST SRE show that variational Bayesian JFA could obtain better performance than JFA using point estimate.
  • Keywords
    belief networks; maximum likelihood estimation; speaker recognition; statistical distributions; Bayesian statistics; JFA; fully correlated posteriori distribution; joint factor analysis; model likelihood; speaker verification; Bayesian methods; Computational complexity; Independent component analysis; Loudspeakers; Maximum likelihood estimation; NIST; Research and development; Speaker recognition; Telecommunications; Uncertainty; Bayesian statistics; Gaussian mixture model; joint factor analysis; speaker verification; variational approximation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
  • Conference_Location
    Taipei
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-2353-8
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2009.4960517
  • Filename
    4960517