• DocumentCode
    561179
  • Title

    On Convergence of Discriminative Training Algorithm for Speaker Recognition

  • Author

    Madikeri, Srikanth R. ; Murthy, Hema A.

  • Author_Institution
    Comput. Sci. & Eng. Dept., Indian Inst. of Technol. Madras, Chennai, India
  • Volume
    1
  • fYear
    2011
  • fDate
    18-21 Dec. 2011
  • Firstpage
    189
  • Lastpage
    193
  • Abstract
    In this paper, we present a discriminative training algorithm for GMM based speaker verification, and develop a convergence proof for the same. During training of the models, instead of performing MAP adaptation of the UBM, the model parameters of the GMM are estimated such that target scores are maximized while impostor scores are minimized. The focus of the algorithm is to estimate more accurately, the parameters of the elements of the mixture that are unique to a speaker. The algorithm uses an Expectation-Maximisation-like framework for estimation of parameters. It is shown that the algorithm converges with appropriate choice of a regularization parameter (α). α balances target and impostor data during each iteration of the algorithm. Fast convergence can be obtained by modifying the regularization parameter after each iteration. It is shown that incrementing α is a sufficient condition for convergence. Further, an impostor data selection technique for fast implementation is addressed. The performance of the system is evaluated on NIST 2003 benchmark dataset. A relative improvement 9.5% is obtained using Mel Frequency Slope as feature.
  • Keywords
    Gaussian processes; expectation-maximisation algorithm; speaker recognition; GMM; Gaussian mixture model; convergence proof; discriminative training algorithm; expectation-maximisation-like framework; impostor data selection technique; mel frequency slope; regularization parameter; speaker recognition; speaker verification; Adaptation models; Convergence; Equations; Mathematical model; Maximum likelihood estimation; Speaker recognition; Training; discriminative training; fast convergence; speaker recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Applications and Workshops (ICMLA), 2011 10th International Conference on
  • Conference_Location
    Honolulu, HI
  • Print_ISBN
    978-1-4577-2134-2
  • Type

    conf

  • DOI
    10.1109/ICMLA.2011.109
  • Filename
    6146967