Title :
Average redundancy rate of the Lempel-Ziv code
Author :
Louchard, Guy ; Szpankowski, Wojciech
Author_Institution :
Dept. d´´Inf., Univ. Libre de Bruxelles, Belgium
Abstract :
It was conjectured that the average redundancy rate, rn, for the Lempel-Ziv code (1978) is Θ(loglog n/log n) where n is the length of the database sequence. However, it was also known that for infinitely many n the redundancy rn is bounded from the below by 2/log n. We settle the above conjecture in the negative by proving that for a memoryless source the average redundancy rate attains asymptotically Ern=(A+δ(n))/log n+O(loglog n/log2n) where A is an explicitly given constant that depends on the source characteristics, and δ(x) is a fluctuating function. This result is a consequence of the established second-order properties for the number of phrases in the Lempel-Ziv algorithm. We also derive the leading term for the kth moment of the number of phrases. Finally, we discuss generalized Lempel-Ziv codes for which the average redundancy rates are computed and compared with the original Lempel-Ziv code
Keywords :
channel capacity; codes; data compression; encoding; memoryless systems; redundancy; sequences; Lempel-Ziv algorithm; Lempel-Ziv code; average redundancy rate; database sequence; fluctuating function; generalized Lempel-Ziv codes; leading term; memoryless source; second-order properties; source characteristics; Computer science; Convergence; Data compression; Databases; Dictionaries; Equations; Information resources; Modems; Noise measurement; Partitioning algorithms;
Conference_Titel :
Data Compression Conference, 1996. DCC '96. Proceedings
Conference_Location :
Snowbird, UT
Print_ISBN :
0-8186-7358-3
DOI :
10.1109/DCC.1996.488314