Title :
Generalized Lempel-Ziv parsing scheme and its preliminary analysis of the average profile
Author :
Louchard, Guy ; Szpankowski, Wojciech
Author_Institution :
Lab. d´´Inf. Theorique, Univ. Libre de Bruxelles, Belgium
Abstract :
The goal of this contribution is twofold: (i) to introduce a generalized Lempel-Ziv parsing scheme, and (ii) to analyze second-order properties of some compression schemes based on the above parsing scheme. We consider a generalized Lempel-Ziv parsing scheme that partitions a sequence of length n into variable phrases (blocks) such that a new block is the longest substring seen in the past by at most b-1 phrases. The case b=1 corresponds to the original Lempel-Ziv scheme. In this paper, we investigate the size of a randomly selected phrase, and the average number of phrases of a given size through analyzing the so called b-digital search tree (b-DST) representation. For a memoryless source, we prove that the size of a typical phrase is asymptotically normally distributed. This result is new even for b=1, and b>1 is a non-trivial extension
Keywords :
data compression; grammars; average profile; b-digital search tree; compression schemes; generalized Lempel-Ziv parsing scheme; memoryless source; preliminary analysis; Computer science; Data compression; Entropy; Frequency; Heart; Information services; Internet; Partitioning algorithms; Tree data structures; Web sites;
Conference_Titel :
Data Compression Conference, 1995. DCC '95. Proceedings
Conference_Location :
Snowbird, UT
Print_ISBN :
0-8186-7012-6
DOI :
10.1109/DCC.1995.515516