Title :
Highly efficient universal coding with classifying to subdictionaries for text compression
Author :
Nakano, Yasuhiko ; Yahagi, Hironori ; Okada, Yoshiyuki ; Yoshida, Shigeru
Author_Institution :
Electron. Syst. Labs., Fujitsu Labs. Ltd., Atsugi, Japan
Abstract :
Describes a practical, locally adaptive data compression algorithm of the LZ78 class. According to the Lempel-Ziv incremental parsing rule, the boundary of a string is not related to the statistical history modeled by finite-state sources. The authors have already reported an algorithm classifying to subdictionaries (CSD), which uses multiple subdictionaries and conditions the current string by using the previous one to obtain a higher compression ratio for image compression. They present a practical implementation of this method for any kind of data, and show that CSD was more efficient than LZC when the UNIX facility for compression. The compression performance of CSD was about 10% better than the LZC with the practical dictionary size, an 8K-entry dictionary when the test data were used form Calgary Compression Corpus. Using hashing, the processing speed of the CSD became as fast as the LZC, though the CSD algorithm was more complicated than the LZC
Keywords :
data compression; glossaries; image coding; word processing; Calgary Compression Corpus; Lempel-Ziv incremental parsing rule; UNIX utility; adaptive data compression algorithm; compression performance; compression ratio; finite-state sources; hashing; image compression; processing speed; subdictionaries; test data; text compression; universal coding; Compression algorithms; Data compression; Dictionaries; History; Holography; Image coding; Laboratories; Microcomputers; Statistical analysis; Testing;
Conference_Titel :
Data Compression Conference, 1994. DCC '94. Proceedings
Conference_Location :
Snowbird, UT
Print_ISBN :
0-8186-5637-9
DOI :
10.1109/DCC.1994.305931