Title :
CGT Code-Based XML Data Compression Method
Author :
Zhang, Shen ; Chen, Sha ; Liang, Yuping
Author_Institution :
Sch. of Comput. Sci. & Technol., Nanchang Hangkong Univ., Nanchang, China
Abstract :
XML is a de-facto standard for exchanging and presenting information on the Web. However, XML data is also recognized as verbose since it heavily inflates the size of the data due to the repeated tags and structures. The data verbosity problem gives rise to many challenges of conventional query processing and data exchange. Compression techniques are the important way to overcome the verbosity problem. According to the features of XML document, we put forward a new XML data compression method called CGTXDC which uses XML Schema to construct XML document tree about the structure information of XML document and adopts CGT code to encode each tree node for maintaining the structure of the original XML document. CGTXDC requires only a single pass over the input XML document during the compression process and don´t need to build the document tree in the memory. The experimental results show much better compression ratio than that of representative XML compression methods, such as Xpress and Xgrind.
Keywords :
Internet; XML; data compression; electronic data interchange; XML data compression method; XML document; Xgrind; Xpress; data exchange; data verbosity problem; query processing; Asset management; Code standards; Computer science; Computer security; Data compression; Data security; Electronic commerce; Information security; Query processing; XML; CGT code; XML Schema; XML document tree; data compression;
Conference_Titel :
Electronic Commerce and Security, 2009. ISECS '09. Second International Symposium on
Conference_Location :
Nanchang
Print_ISBN :
978-0-7695-3643-9
DOI :
10.1109/ISECS.2009.128