• DocumentCode
    2769458
  • Title

    Bayesian adaptation in HMM training and decoding using a mixture of feature transforms

  • Author

    Tsakalidis, Stavros ; Matsoukas, Spyros

  • Author_Institution
    BBN Technol., Cambridge
  • fYear
    2007
  • fDate
    9-13 Dec. 2007
  • Firstpage
    329
  • Lastpage
    334
  • Abstract
    Adaptive training under a Bayesian framework addresses some limitations of the standard maximum likelihood approaches. Also, the adaptively trained system can be directly used in unsupervised inference. The Bayesian framework uses a distribution of the transform rather than a point estimate. A continuous transform distribution makes the integral associated with the Bayesian framework intractable and therefore various approximations have been proposed. In this paper we model the transform distribution via a mixture of transforms. Under this model, the likelihood of an utterance is computed as a weighted sum of the likelihoods obtained by transforming its features based on each of the transforms in the mixture, with weights set to the transform priors. Experimental results on Arabic broadcast news exhibit increased likelihood on acoustic training data and improved speech recognition performance on unseen test data, compared to speaker independent and standard adaptive models.
  • Keywords
    Bayes methods; adaptive decoding; hidden Markov models; inference mechanisms; maximum likelihood decoding; speaker recognition; transforms; unsupervised learning; Bayesian speaker adaptive training; continuous transform distribution; decoding; feature transforms; hidden Markov model; maximum likelihood approach; unsupervised inference; Acoustic testing; Bayesian methods; Broadcasting; Hidden Markov models; Loudspeakers; Maximum likelihood decoding; Maximum likelihood estimation; Speech recognition; Stochastic processes; Training data; Bayesian inference; adaptive training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on
  • Conference_Location
    Kyoto
  • Print_ISBN
    978-1-4244-1746-9
  • Electronic_ISBN
    978-1-4244-1746-9
  • Type

    conf

  • DOI
    10.1109/ASRU.2007.4430133
  • Filename
    4430133