Title :
XQPoint: A queriable homomorphic XML compressor
Author :
Al-Hamadani, Baydaa T. ; Alwan, Raad F. ; Lu, Joan
Author_Institution :
Huddersfield Univ., Huddersfield, UK
Abstract :
XML has becoming the standard way for representing and transforming data over the World Wide Web. The annoying problem with XML documents is that they have a very high ratio of redundancy, which makes these documents storage demanding and require a large network band-width for transmission. To remedy this problem, a lot of approaches had been conducted in order to compress XML documents. Some of these approaches supply querying the compressed documents, while others compress the XML documents for archival purposes. In this paper we propose a new XML compression technique that obeys the structure of the XML documents and provides the ability to querying the compressed document with both content and structure (CAS) queries type. XML elements and attributes names are encoded by using fixed-point dictionary-based technique. Other XML data are organized into special containers according to their path from the root attribute, and the containers are compressed using the same fixed-point technique. Using different types of XML documents and different styles of user queries, the XQPoint has been experimented to test its effectiveness in both the compression ratio and the querying performance.
Keywords :
Internet; XML; content-based retrieval; data compression; data structures; information retrieval systems; World Wide Web; XML compression; XQPoint; archival purpose; content and structure queries; data representation; data transformation; documents storage; fixed-point dictionary; queriable homomorphic XML compressor; Arithmetic; Containers; Content addressable storage; Encoding; Huffman coding; Information retrieval; Testing; Text analysis; Web sites; XML; XML; compression;
Conference_Titel :
Innovations in Information Technology, 2009. IIT '09. International Conference on
Conference_Location :
Al Ain
Print_ISBN :
978-1-4244-5698-7
DOI :
10.1109/IIT.2009.5413789