• DocumentCode
    1865582
  • Title

    Discriminative Models for Speech Recognition

  • Author

    Gales, M.J.F.

  • Author_Institution
    Cambridge Univ., Cambridge
  • fYear
    2007
  • fDate
    Jan. 29 2007-Feb. 2 2007
  • Firstpage
    170
  • Lastpage
    176
  • Abstract
    The vast majority of automatic speech recognition systems use hidden Markov models (HMMs) as the underlying acoustic model. Initially these models were trained based on the maximum likelihood criterion. Significant performance gains have been obtained by using discriminative training criteria, such as maximum mutual information and minimum phone error. However, the underlying acoustic model is still generative, with the associated constraints on the state and transition probability distributions, and classification is based on Bayes´ decision rule. Recently, there has been interest in examining discriminative, or direct, models for speech recognition. This paper briefly reviews the forms of discriminative models that have been investigated. These include maximum entropy Markov models, hidden conditional random fields and conditional augmented models. The relationships between the various models and issues with applying them to large vocabulary continuous speech recognition will be discussed.
  • Keywords
    Bayes methods; Markov processes; maximum entropy methods; speech recognition; statistical distributions; Bayes decision rule; acoustic model; automatic speech recognition; conditional augmented models; discriminative models; hidden Markov models; hidden conditional random fields; maximum entropy Markov models; probability distributions; Acoustical engineering; Automatic speech recognition; Bayesian methods; Hidden Markov models; Mutual information; Performance gain; Probability distribution; Speech recognition; Training data; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Theory and Applications Workshop, 2007
  • Conference_Location
    La Jolla, CA
  • Print_ISBN
    978-0-615-15314-8
  • Type

    conf

  • DOI
    10.1109/ITA.2007.4357576
  • Filename
    4357576