Title :
A new framed-based MCE training strategy for Mandarin LVCSR
Author :
Feng, Junlan ; Du, Limin
Author_Institution :
Inst. of Acoust., Acad. Sinica, Beijing, China
Abstract :
Minimum classification error training (MCE), as a method of discriminative training, has proved successful to reduce error rates in HMM-based speech recognition systems. Several drawbacks of the classical MCE training method, such as high computation complexity, small amount of contributing speech segments, and the difficulty to determine the learning rate, have been improved by a new training framework proposed in this paper. The new framework includes (i) predefining the word lattice for each utterance to limit the search space; (ii) selecting frame-based N-top active models as competing model sets of the correct model; (iii) defining different learning rates for different model parameters experimentally. We give experimental results on the 863 Mandarin speech corpus. Our method gives a 2.0% increase of the word-level accuracy rate of syllable loop-back search compared to MLE trained models by two iterations
Keywords :
computational complexity; error statistics; hidden Markov models; search problems; speech recognition; HMM-based speech recognition; Mandarin LVCSR; Mandarin speech Corpus; computation complexity; contributing speech segments; discriminative training; error rate; frame-based N-top active models; framed-based MCE training strategy; large vocabulary continuous speech recognition; learning rate; minimum classification error training; search space; syllable loop-back search; training framework; word lattice; word-level accuracy rate; Acoustics; Cost function; Equations; Error analysis; Hidden Markov models; Lattices; Maximum likelihood estimation; Speech analysis; Speech recognition; Training data;
Conference_Titel :
Signal Processing Proceedings, 2000. WCCC-ICSP 2000. 5th International Conference on
Conference_Location :
Beijing
Print_ISBN :
0-7803-5747-7
DOI :
10.1109/ICOSP.2000.891635