• DocumentCode
    1111508
  • Title

    Estimation of probabilities from sparse data for the language model component of a speech recognizer

  • Author

    Katz, Slava M.

  • Author_Institution
    IBM T. J. Watson Research Center, Yorktown Heights, N.Y.
  • Volume
    35
  • Issue
    3
  • fYear
    1987
  • fDate
    3/1/1987 12:00:00 AM
  • Firstpage
    400
  • Lastpage
    401
  • Abstract
    The description of a novel type of m-gram language model is given. The model offers, via a nonlinear recursive procedure, a computation and space efficient solution to the problem of estimating probabilities from sparse data. This solution compares favorably to other proposed methods. While the method has been developed for and successfully implemented in the IBM Real Time Speech Recognizers, its generality makes it applicable in other areas where the problem of estimating probabilities from sparse data arises.
  • Keywords
    Acoustic signal processing; Maximum likelihood estimation; Natural languages; Probability; Recursive estimation; Speech processing; Speech recognition; Statistics;
  • fLanguage
    English
  • Journal_Title
    Acoustics, Speech and Signal Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0096-3518
  • Type

    jour

  • DOI
    10.1109/TASSP.1987.1165125
  • Filename
    1165125