Title :
Estimation of probabilities from sparse data for the language model component of a speech recognizer
Author_Institution :
IBM T. J. Watson Research Center, Yorktown Heights, N.Y.
fDate :
3/1/1987 12:00:00 AM
Abstract :
The description of a novel type of m-gram language model is given. The model offers, via a nonlinear recursive procedure, a computation and space efficient solution to the problem of estimating probabilities from sparse data. This solution compares favorably to other proposed methods. While the method has been developed for and successfully implemented in the IBM Real Time Speech Recognizers, its generality makes it applicable in other areas where the problem of estimating probabilities from sparse data arises.
Keywords :
Acoustic signal processing; Maximum likelihood estimation; Natural languages; Probability; Recursive estimation; Speech processing; Speech recognition; Statistics;
Journal_Title :
Acoustics, Speech and Signal Processing, IEEE Transactions on
DOI :
10.1109/TASSP.1987.1165125