DocumentCode :
2702093
Title :
Cross-Validation EM Training for Robust Parameter Estimation
Author :
Shinozaki, Tetsuo ; Ostendorf, Mari
Author_Institution :
Kyoto Univ., Japan
Volume :
4
fYear :
2007
fDate :
15-20 April 2007
Abstract :
A new maximum likelihood training algorithm is proposed that compensates for weaknesses of the EM algorithm by using cross-validation likelihood in the expectation step to avoid overtraining. By using a set of sufficient statistics associated with a partitioning of the training data, as in parallel EM, the algorithm has the same order of computational requirements as the original EM algorithm. Analyses using a GMM with artificial data show the proposed algorithm is more robust for overtraining than the conventional EM algorithm. Large vocabulary recognition experiments on Mandarin broadcast news data show that the method makes better use of more parameters and gives lower recognition error rates than EM training.
Keywords :
expectation-maximisation algorithm; learning (artificial intelligence); natural languages; speech recognition; GMM; Mandarin broadcast news data; cross-validation EM training; maximum likelihood training algorithm; recognition error rates; robust parameter estimation; statistics; vocabulary recognition; Algorithm design and analysis; Broadcasting; Concurrent computing; Maximum likelihood estimation; Parameter estimation; Partitioning algorithms; Robustness; Statistics; Training data; Vocabulary; EM training; cross-validation; overtraining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
ISSN :
1520-6149
Print_ISBN :
1-4244-0727-3
Type :
conf
DOI :
10.1109/ICASSP.2007.366943
Filename :
4218131
Link To Document :
بازگشت