Title :
Gaussian Mixture Language Models for Speech Recognition
Author :
Afify, M. ; Siohan, Olivier ; Sarikaya, R.
Author_Institution :
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
Abstract :
We propose a Gaussian mixture language model for speech recognition. Two potential benefits of using this model are smoothing unseen events, and ease of adaptation. It is shown how this model can be used alone or in conjunction with a a conventional N-gram model to calculate word probabilities. An interesting feature of the proposed technique is that many methods developed for acoustic models can be easily ported to GMLM. We developed two implementations of the proposed model for large vocabulary Arabic speech recognition with results comparable to conventional N-gram.
Keywords :
Gaussian processes; natural language processing; speech recognition; Gaussian mixture language models; acoustic models; speech recognition; vocabulary Arabic speech recognition; History; Large-scale systems; Maximum likelihood linear regression; Natural languages; Neural networks; Probability; Smoothing methods; Space technology; Speech recognition; Vocabulary; Gaussian mixture model; Language model; N-gram; continuous space;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0727-3
DOI :
10.1109/ICASSP.2007.367155