Title :
A general discriminative training algorithm for speech recognition using weighted finite-state transducers
Author :
Yong Zhao ; Ljolje, Andrej ; Caseiro, Diamantino ; Biing-Hwang Juang
Author_Institution :
Dept. of Electr. & Comput. Eng., Georgia Inst. of Technol., Atlanta, GA, USA
Abstract :
In this paper, we present a general algorithmic framework based on WFSTs for implementing a variety of discriminative training methods, such as MMI, MCE, and MPE/MWE. In contrast to the ordinary word lattices, the transducer-based lattices are more amenable to representing and manipulating the underlying hypothesis space and have a finer granularity at the HMM-state level. The transducers are processed into a two-layer hierarchy: at a high level, it is analogous to a word lattice, and each word transition embodies an HMM-state subgraph for that word at a lower level. This hierarchy combined with the appropriate customization of the transducers leads to a flexible implementation for all of the training criteria being discussed. The effectiveness of the framework is verified on two speech recognition tasks: Resource Management, and AT&T SCANMail, an internal voicemail-to-text task.
Keywords :
hidden Markov models; speech recognition; transducers; HMM-state level; HMM-state subgraph; MCE; MMI; MPE-MWE; general discriminative training algorithm; hidden Markov models; speech recognition tasks; transducer-based lattices; two-layer hierarchy; weighted finite-state transducers; Accuracy; Context; Hidden Markov models; Lattices; Speech recognition; Training; Transducers; discriminative training; speech recognition; weighted finite-state transducer;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6288849