DocumentCode :
3782048
Title :
Discriminative mixture weight estimation for large Gaussian mixture models
Author :
F. Beaufays;M. Weintraub; Yochai Konig
Author_Institution :
Speech Technol. & Res. Lab., SRI Int., Menlo Park, CA, USA
Volume :
1
fYear :
1999
Firstpage :
337
Abstract :
This paper describes a new approach to acoustic modeling for large vocabulary continuous speech recognition (LVCSR) systems. Each phone is modeled with a large Gaussian mixture model (GMM) whose context-dependent mixture weights are estimated with a sentence-level discriminative training criterion. The estimation problem is cast in a neural network framework, which enables the incorporation of the appropriate constraints on the mixture weight vectors, and allows a straight-forward training procedure, based on steepest descent. Experiments conducted on the Callhome-English and Switchboard databases show a significant improvement of the acoustic model performance, and a somewhat lesser improvement with the combined acoustic and language models.
Keywords :
"Context modeling","Power system modeling","Error analysis","Speech recognition","Databases","Decision trees","Laboratories","Vocabulary","Neural networks","Acoustic noise"
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-5041-3
Type :
conf
DOI :
10.1109/ICASSP.1999.758131
Filename :
758131
Link To Document :
بازگشت