• DocumentCode
    3162588
  • Title

    A general discriminative training algorithm for speech recognition using weighted finite-state transducers

  • Author

    Yong Zhao ; Ljolje, Andrej ; Caseiro, Diamantino ; Biing-Hwang Juang

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Georgia Inst. of Technol., Atlanta, GA, USA
  • fYear
    2012
  • fDate
    25-30 March 2012
  • Firstpage
    4217
  • Lastpage
    4220
  • Abstract
    In this paper, we present a general algorithmic framework based on WFSTs for implementing a variety of discriminative training methods, such as MMI, MCE, and MPE/MWE. In contrast to the ordinary word lattices, the transducer-based lattices are more amenable to representing and manipulating the underlying hypothesis space and have a finer granularity at the HMM-state level. The transducers are processed into a two-layer hierarchy: at a high level, it is analogous to a word lattice, and each word transition embodies an HMM-state subgraph for that word at a lower level. This hierarchy combined with the appropriate customization of the transducers leads to a flexible implementation for all of the training criteria being discussed. The effectiveness of the framework is verified on two speech recognition tasks: Resource Management, and AT&T SCANMail, an internal voicemail-to-text task.
  • Keywords
    hidden Markov models; speech recognition; transducers; HMM-state level; HMM-state subgraph; MCE; MMI; MPE-MWE; general discriminative training algorithm; hidden Markov models; speech recognition tasks; transducer-based lattices; two-layer hierarchy; weighted finite-state transducers; Accuracy; Context; Hidden Markov models; Lattices; Speech recognition; Training; Transducers; discriminative training; speech recognition; weighted finite-state transducer;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
  • Conference_Location
    Kyoto
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4673-0045-2
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2012.6288849
  • Filename
    6288849