• DocumentCode
    454538
  • Title

    Augmented Statistical Models for Speech Recognition

  • Author

    Layton, M.I. ; Gales, M.J.F.

  • Author_Institution
    Eng. Dept., Cambridge Univ.
  • Volume
    1
  • fYear
    2006
  • fDate
    14-19 May 2006
  • Abstract
    Recently there has been significant interest in developing new acoustic models for speech recognition. One such model, that allows complex dependencies to be represented, is the augmented statistical model. This incorporates additional dependencies by constructing a local exponential expansion of a standard HMM. Unfortunately, the resulting model often has an intractable normalisation term, rendering training difficult for all but binary classification tasks. In this paper, conditional augmented (C-Aug) models are proposed as an attractive alternative. Instead of modelling utterance likelihoods and inferring decision boundaries, C-Aug models directly model the posterior probability of class labels, conditioned on the utterance. The resulting model is easy to normalise and can be trained using conditional maximum likelihood estimation. In addition, as a convex model, the optimisation converges to a global maximum
  • Keywords
    maximum likelihood estimation; probability; speech recognition; augmented statistical models; conditional augmented models; conditional maximum likelihood estimation; posterior probability; speech recognition; Acoustical engineering; Hidden Markov models; Maximum likelihood estimation; Mutual information; Speech recognition; Statistical distributions; Statistics; Taylor series; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
  • Conference_Location
    Toulouse
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0469-X
  • Type

    conf

  • DOI
    10.1109/ICASSP.2006.1659974
  • Filename
    1659974