Title :
On Turing´s formula for word probabilities
Author_Institution :
IBM T. J. Watson Research Center, Yorktown Heights, NY, USA
fDate :
12/1/1985 12:00:00 AM
Abstract :
A. M. Turing, in a 1941 personal communication to I. J. Good, suggested a formula for estimating probabilities of words in text and, more generally, of species in a mixed population of various species. It is remarkable that Turing´s formula can be obtained by significantly different statistical methods; we compare three ways to obtain it.
Keywords :
Bayesian methods; Distributed computing; Frequency estimation; Natural languages; Probability; Smoothing methods; Speech recognition; Statistical analysis; Statistical distributions; Vocabulary;
Journal_Title :
Acoustics, Speech and Signal Processing, IEEE Transactions on
DOI :
10.1109/TASSP.1985.1164728