DocumentCode
3482095
Title
XQPoint: A queriable homomorphic XML compressor
Author
Al-Hamadani, Baydaa T. ; Alwan, Raad F. ; Lu, Joan
Author_Institution
Huddersfield Univ., Huddersfield, UK
fYear
2009
fDate
15-17 Dec. 2009
Firstpage
95
Lastpage
99
Abstract
XML has becoming the standard way for representing and transforming data over the World Wide Web. The annoying problem with XML documents is that they have a very high ratio of redundancy, which makes these documents storage demanding and require a large network band-width for transmission. To remedy this problem, a lot of approaches had been conducted in order to compress XML documents. Some of these approaches supply querying the compressed documents, while others compress the XML documents for archival purposes. In this paper we propose a new XML compression technique that obeys the structure of the XML documents and provides the ability to querying the compressed document with both content and structure (CAS) queries type. XML elements and attributes names are encoded by using fixed-point dictionary-based technique. Other XML data are organized into special containers according to their path from the root attribute, and the containers are compressed using the same fixed-point technique. Using different types of XML documents and different styles of user queries, the XQPoint has been experimented to test its effectiveness in both the compression ratio and the querying performance.
Keywords
Internet; XML; content-based retrieval; data compression; data structures; information retrieval systems; World Wide Web; XML compression; XQPoint; archival purpose; content and structure queries; data representation; data transformation; documents storage; fixed-point dictionary; queriable homomorphic XML compressor; Arithmetic; Containers; Content addressable storage; Encoding; Huffman coding; Information retrieval; Testing; Text analysis; Web sites; XML; XML; compression;
fLanguage
English
Publisher
ieee
Conference_Titel
Innovations in Information Technology, 2009. IIT '09. International Conference on
Conference_Location
Al Ain
Print_ISBN
978-1-4244-5698-7
Type
conf
DOI
10.1109/IIT.2009.5413789
Filename
5413789
Link To Document