Title :
Lossless Data Compression via Substring Enumeration
Author :
Dube, Danny ; Beaudoin, Vincent
Author_Institution :
Univ. Laval, Quebec City, QC, Canada
Abstract :
We present a technique that compresses a string w by enumerating all the substrings of w. The substrings are enumerated from the shortest to the longest and in lexicographic order. Compression is obtained from the fact that the set of the substrings of a particular length gives a lot of information about the substrings that are one bit longer. A linear-time, linear-space algorithm is presented. Experimental results show that the compression efficiency comes close to that of the best PPM variants. Other compression techniques are compared to ours.
Keywords :
algorithm theory; data compression; lexicographic order; linear space algorithm; linear time algorithm; lossless data compression; substring enumeration; Data compression;
Conference_Titel :
Data Compression Conference (DCC), 2010
Conference_Location :
Snowbird, UT
Print_ISBN :
978-1-4244-6425-8
Electronic_ISBN :
1068-0314
DOI :
10.1109/DCC.2010.28