DocumentCode :
1111508
Title :
Estimation of probabilities from sparse data for the language model component of a speech recognizer
Author :
Katz, Slava M.
Author_Institution :
IBM T. J. Watson Research Center, Yorktown Heights, N.Y.
Volume :
35
Issue :
3
fYear :
1987
fDate :
3/1/1987 12:00:00 AM
Firstpage :
400
Lastpage :
401
Abstract :
The description of a novel type of m-gram language model is given. The model offers, via a nonlinear recursive procedure, a computation and space efficient solution to the problem of estimating probabilities from sparse data. This solution compares favorably to other proposed methods. While the method has been developed for and successfully implemented in the IBM Real Time Speech Recognizers, its generality makes it applicable in other areas where the problem of estimating probabilities from sparse data arises.
Keywords :
Acoustic signal processing; Maximum likelihood estimation; Natural languages; Probability; Recursive estimation; Speech processing; Speech recognition; Statistics;
fLanguage :
English
Journal_Title :
Acoustics, Speech and Signal Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
0096-3518
Type :
jour
DOI :
10.1109/TASSP.1987.1165125
Filename :
1165125
Link To Document :
بازگشت