• DocumentCode
    2179063
  • Title

    Discriminative training for Bayesian sensing hidden Markov models

  • Author

    Saon, George ; Chien, Jen-Tzung

  • Author_Institution
    IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    5316
  • Lastpage
    5319
  • Abstract
    We describe feature space and model space discriminative training for a new class of acoustic models called Bayesian sensing hidden Markov models (BS-HMMs). In BS-HMMs, speech data is represented by a set of state-dependent basis vectors. The relevance of a feature vector to different bases is determined by the precision matrices of the sensing weights. The basis vectors and the precision matrices of the reconstruction errors are jointly estimated by optimizing a maximum mutual information (MMI) criterion. Additionally, we discuss the training of an fMPE-style discriminative feature transformation under the same criterion given these models. Experimental results on an LVCSR task show that the proposed models outperform discriminatively trained conventional HMMs with Gaussian mixture models (GMMs). Cross-adapting the baseline GMM-HMMs to the BS-HMM output yields a 6% relative gain which indicates that the two systems make different errors.
  • Keywords
    belief networks; hidden Markov models; matrix algebra; speech processing; BS-HMM; Bayesian sensing hidden Markov models; LVCSR; MMI criterion; discriminative training; fMPE-style discriminative feature transformation; matrices; maximum mutual information criterion; speech data; state-dependent basis vectors; Acoustics; Bayesian methods; Hidden Markov models; Sensors; Smoothing methods; Training; Transforms; Bayesian learning; basis representation; discriminative training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5947558
  • Filename
    5947558