Title :
Merging prediction by partial matching with structural contexts model
Author :
Adíego, Joaquin ; de la Puente, P. ; Navarro, Gonzalo
Author_Institution :
Dpto. de Informatica, Valladolid Univ., Spain
Abstract :
This paper discusses the possibility of considering the text structure in the context of compressed structured documents. This paper also proposes a compression technique for structured documents, called SCMPPM, which combines the prediction by partial matching technique with structural contexts model idea, which takes advantage of the context information usually implicit in the structure of the text. The experimental results shows significant gains over the methods that are insensitive to the structure and over the current methods that consider the structure. This method actually improves compression ratios with respect to the basic SCM technique.
Keywords :
Huffman codes; XML; data compression; text analysis; compression ratio; context compression; context information; prediction partial matching; structural context model; structured document; text structure; Context modeling; Merging;
Conference_Titel :
Data Compression Conference, 2004. Proceedings. DCC 2004
Print_ISBN :
0-7695-2082-0
DOI :
10.1109/DCC.2004.1281498