DocumentCode :
3422280
Title :
Boosted MMI for model and feature-space discriminative training
Author :
Povey, Daniel ; Kanevsky, Dimitri ; Kingsbury, Brian ; Ramabhadran, Bhuvana ; Saon, George ; Visweswariah, Karthik
Author_Institution :
TJ. Watson Res. Center, IBM, Yorktown Heights, NY
fYear :
2008
fDate :
March 31 2008-April 4 2008
Firstpage :
4057
Lastpage :
4060
Abstract :
We present a modified form of the maximum mutual information (MMI) objective function which gives improved results for discriminative training. The modification consists of boosting the likelihoods of paths in the denominator lattice that have a higher phone error relative to the correct transcript, by using the same phone accuracy function that is used in Minimum Phone Error (MPE) training. We combine this with another improvement to our implementation of the Extended Baum-Welch update equations for MMI, namely the canceling of any shared part of the numerator and denominator statistics on each frame (a procedure that is already done in MPE). This change affects the Gaussian-specific learning rate. We also investigate another modification whereby we replace I-smoothing to the ML estimate with I-smoothing to the previous iteration´s value. Boosted MMI gives better results than MPE in both model and feature-space discriminative training, although not consistently.
Keywords :
Gaussian processes; feature extraction; speech recognition; Baum-Welch update equations; Gaussian-specific learning rate; denominator lattice; feature space discriminative training; maximum mutual information; minimum phone error; objective function; phone accuracy function; Boosting; Equations; Error correction; Gaussian processes; Hidden Markov models; Lattices; Maximum likelihood estimation; Mutual information; Speech recognition; Statistics; Discriminative Training; MMI; MPE; Maximum Margin; Speech Recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
ISSN :
1520-6149
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2008.4518545
Filename :
4518545
Link To Document :
بازگشت