Title :
Dynamic Markov Compression Using a Crossbar-Like Tree Initial Structure for Chinese Texts
Author :
Ong, Ghim-Hwee ; Ng, Jun-Ping
Author_Institution :
Dept. of Comput. Sci., Nat. Univ. of Singapore
Abstract :
This paper proposes the use of a crossbar-like tree structure to use with dynamic Markov compression (DMC) for the compression of Chinese text files. DMC had previously been found to be more effective than common compression techniques like compress and pack and gives a compression gain of between 13.1% and 32.0%. This initial structure is able to improve on DMC´s compression results, and outperforms the various initial structures commonly adopted, such as the single-state, linear, tree or braid structures by a gain ranging from 1.5% to 9.6%
Keywords :
Markov processes; data compression; natural languages; text analysis; tree data structures; Chinese text file; crossbar-like tree initial structure; dynamic Markov compression; Arithmetic; Binary trees; Cloning; Computer science; Encoding; Information technology; Predictive models; Probability distribution; Tree data structures;
Conference_Titel :
Information Technology and Applications, 2005. ICITA 2005. Third International Conference on
Conference_Location :
Sydney, NSW
Print_ISBN :
0-7695-2316-1
DOI :
10.1109/ICITA.2005.119