Title :
Multiple-dictionary compression using partial matching
Author :
Hoang, Dzung T. ; Long, Philip M. ; Vitter, Jeffrey Scott
Author_Institution :
Dept. of Comput. Sci., Duke Univ., Durham, NC, USA
Abstract :
Motivated by the desire to find text compressors that compress better than existing dictionary methods, but run faster than PPM implementations, we describe methods for text compression using multiple dictionaries, one for each context of preceding characters, where the contexts have varying lengths. The context to be used is determined using an escape mechanism similar to that of PPM methods. We describe modifications of three popular dictionary coders along these lines and experiments evaluating their efficacy using the text files in the Calgary corpus. Our results suggest that modifying LZ77 along these lines yields an improvement in compression of about 4%, that modifying LZFG yields a compression improvement of about 8%, and that modifying LZW in this manner yields an average improvement on the order of 12%
Keywords :
data compression; encoding; Calgary corpus; escape mechanism; multiple dictionaries; multiple-dictionary compression; partial matching; text compressors; text files; Arithmetic; Compressors; Computer science; Dictionaries; Encoding; Entropy; Probability distribution; Statistical analysis; Statistics; Yield estimation;
Conference_Titel :
Data Compression Conference, 1995. DCC '95. Proceedings
Conference_Location :
Snowbird, UT
Print_ISBN :
0-8186-7012-6
DOI :
10.1109/DCC.1995.515517