• DocumentCode
    3484579
  • Title

    Discriminative splitting of Gaussian/log-linear mixture HMMs for speech recognition

  • Author

    Tahir, Muhammad Ali ; Schlüter, Ralf ; Ney, Hermann

  • Author_Institution
    Comput. Sci. Dept., RWTH Aachen Univ., Aachen, Germany
  • fYear
    2011
  • fDate
    11-15 Dec. 2011
  • Firstpage
    7
  • Lastpage
    11
  • Abstract
    This paper presents a method to incorporate mixture density splitting into the acoustic model discriminative log-linear training. The standard method is to obtain a high resolution model by maximum likelihood training and density splitting, and then further training this model discriminatively. For a single Gaussian density per state the log-linear MMI optimization is a global maximum problem, and by further splitting and discriminative training of this model we can get a higher complexity model. The mixture training is not a global maximum problem, nevertheless experimentally we achieve large gains in the objective function and corresponding moderate gains in the word error rate on a large vocabulary corpus.
  • Keywords
    Gaussian processes; hidden Markov models; maximum likelihood estimation; speech recognition; Gaussian-log-linear mixture HMM; acoustic model discriminative log-linear training; complexity model; density splitting; discriminative splitting; high resolution model; log-linear MMI optimization; maximum likelihood training; mixture density splitting; speech recognition; vocabulary corpus; Hidden Markov models; Mel frequency cepstral coefficient; Optimization; Speech recognition; Training; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on
  • Conference_Location
    Waikoloa, HI
  • Print_ISBN
    978-1-4673-0365-1
  • Electronic_ISBN
    978-1-4673-0366-8
  • Type

    conf

  • DOI
    10.1109/ASRU.2011.6163896
  • Filename
    6163896