Title :
XML Schema Based Compression Technology over XML Data Stream
Author :
Zhang, Xiaolin ; Zhai, Guofeng ; Tian, Rong
Author_Institution :
Inf. Eng. Coll., Inner Mongolia Univ. of Sci. & Technol., Baotou, China
Abstract :
As XML has become the standard of data exchange and express of Internet and e-commerce, it is being more and more widely used, while XML is a kind of self-described language, with a large number of redundant structural information, so that is the hot study now on how to use XML data stream to be reasonable and efficient. The existing compression techniques of XML require two-pass scan on data, which is not suitable for the data stream. In this paper, a new compression technology method is proposed to compress and decompress XML data stream, which first get the structure data through analyzing and parsing the XML schema, then encode it with dynamic Huffman encoding, and finally complete the compression and decompression of XML data stream on real time. Experimental results show that this algorithm in the data compression ratio and time of compression is superior to the traditional algorithm.
Keywords :
Huffman codes; XML; data compression; data structures; electronic data interchange; Internet; XML data stream compression; XML data stream decompression; XML schema parsing; data exchange; dynamic Huffman encoding; e-commerce; redundant structural information; selfdescribed language; structure data; two-pass scan; Compression algorithms; Costs; Data compression; Data engineering; Dictionaries; Educational institutions; Encoding; Frequency; Query processing; XML;
Conference_Titel :
Web Information Systems and Applications Conference, 2009. WISA 2009. Sixth
Conference_Location :
Xuzhou, Jiangsu
Print_ISBN :
978-0-7695-3874-7
DOI :
10.1109/WISA.2009.46