• DocumentCode
    3165296
  • Title

    Discriminative training for speech recognition is compensating for statistical dependence in the HMM framework

  • Author

    Gillick, Dan ; Wegmann, Steven ; Gillick, Larry

  • Author_Institution
    Int. Comput. Sci. Inst., Berkeley, CA, USA
  • fYear
    2012
  • fDate
    25-30 March 2012
  • Firstpage
    4745
  • Lastpage
    4748
  • Abstract
    The parameters of the standard Hidden Markov Model framework for speech recognition are typically trained via Maximum Likelihood. However, better recognition performance is achievable with discriminative training criteria like Maximum Mutual Information or Minimum Phone Error. While it is generally accepted that these discriminative criteria are better suited to minimizing Word Error Rate, there is very little qualitative intuition for how the improvements are achieved. Through a series of “resampling” experiments, we show that discriminative training (MPE in particular) appears to be compensating for a specific incorrect assumption of the HMM-that speech frames are conditionally independent.
  • Keywords
    hidden Markov models; maximum likelihood estimation; speech recognition; statistical analysis; HMM framework; discriminative training; hidden Markov model framework; maximum likelihood; maximum mutual information; minimum phone error; speech recognition; statistical dependence compensation; word error rate minimization; Acoustics; Data models; Hidden Markov models; Maximum likelihood estimation; Speech; Speech recognition; Training; MMI; MPE; discriminative training; sampling; speech; statistical independence;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
  • Conference_Location
    Kyoto
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4673-0045-2
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2012.6288979
  • Filename
    6288979