DocumentCode
1111508
Title
Estimation of probabilities from sparse data for the language model component of a speech recognizer
Author
Katz, Slava M.
Author_Institution
IBM T. J. Watson Research Center, Yorktown Heights, N.Y.
Volume
35
Issue
3
fYear
1987
fDate
3/1/1987 12:00:00 AM
Firstpage
400
Lastpage
401
Abstract
The description of a novel type of m-gram language model is given. The model offers, via a nonlinear recursive procedure, a computation and space efficient solution to the problem of estimating probabilities from sparse data. This solution compares favorably to other proposed methods. While the method has been developed for and successfully implemented in the IBM Real Time Speech Recognizers, its generality makes it applicable in other areas where the problem of estimating probabilities from sparse data arises.
Keywords
Acoustic signal processing; Maximum likelihood estimation; Natural languages; Probability; Recursive estimation; Speech processing; Speech recognition; Statistics;
fLanguage
English
Journal_Title
Acoustics, Speech and Signal Processing, IEEE Transactions on
Publisher
ieee
ISSN
0096-3518
Type
jour
DOI
10.1109/TASSP.1987.1165125
Filename
1165125
Link To Document