مرکز منطقه ای اطلاع رساني علوم و فناوري - Compression and predictive distributions for large alphabet i.i.d and Markov models

DocumentCode :

1780401

Title :

Compression and predictive distributions for large alphabet i.i.d and Markov models

Author :

Xiao Yang ; Barron, Andrew R.

Author_Institution :

Dept. of Stat., Yale Univ., New Haven, CT, USA

fYear :

2014

fDate :

June 29 2014-July 4 2014

Firstpage :

2504

Lastpage :

2508

Abstract :

This paper considers coding and predicting sequences of random variables generated from a large alphabet. We start from the i.i.d model and propose a simple coding distribution formulated by a product of tilted Poisson distributions which achieves close to optimal performance. Then we extend to Markov models, and in particular, tree sources. A context tree based algorithm is designed according to the frequency of various contexts in the data. It is a greedy algorithm which seeks for the greatest savings in codelength when constructing the tree. Compression and prediction of individual counts associated with the contexts again uses a product of tilted Poisson distributions. Implementing this method on a Chinese novel, about 20.56% savings in codelength is achieved compared to the i.i.d model.

Keywords :

Markov processes; Poisson distribution; codes; greedy algorithms; trees (mathematics); Markov models; codelength; coding distribution; context tree based algorithm; greedy algorithm; large alphabet data compression; tilted Poisson distributions; Context; Encoding; Markov processes; Predictive models; Random variables; Vectors;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Information Theory (ISIT), 2014 IEEE International Symposium on

Conference_Location :

Honolulu, HI

Type :

conf

DOI :

10.1109/ISIT.2014.6875285

Filename :

6875285

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1780401