Title :
On Probability Estimation via Relative Frequencies and Discount
Author :
Mattern, Christopher
Author_Institution :
Tech. Univ. Ilmenau, Ilmenau, Germany
Abstract :
Probability estimation is an elementary building block of every statistical data compression algorithm. In practice probability estimation is often based on relative letter frequencies which get scaled down, when their sum is too large. Such algorithms are attractive in terms of memory requirements, running time and practical performance. However, there still is a lack of theoretical understanding. In this work we formulate a typical probability estimation algorithm based on relative frequencies and frequency discount, Algorithm RFD. Our main contribution is its theoretical analysis. We show that Algorithm RFD performs almost as good as any piecewise stationary model with either bounded or unbounded letter probabilities. This theoretically confirms the recency effect of periodic frequency discount, which has often been observed empirically.
Keywords :
data compression; estimation theory; probability; statistical analysis; Algorithm RFD; periodic frequency discount; piecewise stationary model; probability estimation algorithm; relative letter frequencies; statistical data compression algorithm; unbounded letter probabilities; Algorithm design and analysis; Encoding; Partitioning algorithms; Prediction algorithms; Probability distribution; Redundancy;
Conference_Titel :
Data Compression Conference (DCC), 2015
Conference_Location :
Snowbird, UT
DOI :
10.1109/DCC.2015.27