Title :
Optimizing path expression queries of XML data
Author :
Li, Yang ; Yi, Ping ; Li, Qiyan
Author_Institution :
CAD Res. Center, Tongji Univ., Shanghai
Abstract :
With the advent of XML as a standard for data representation and exchange on the Internet, storing and querying XML data becomes more and more important. This poses a new challenge concerning indexing and searching XML data, because conventional approaches based on relational model may not meet the processing requirements for XML data. In this paper, we propose a path index based on Patricia-tries, namely PT index. Our PT index structure offers several novel features. First, it can support to fast search data by its structure based on Patricia-tries. Second, the path indexes are compressed so that they can be stored in memory. Thirdly, because PT index includes structure and text of XML data, we can get the results from the PT index without reading original XML data from disk. We further analyze time complexity and space complexity of PT index. Experimental results from our prototype system implementation show that the PT index can outperform some representative index approaches, such as DataGuide, B+ tree index and Index Fabric
Keywords :
XML; computational complexity; database indexing; query processing; tree data structures; tree searching; Internet; Patricia-tries; XML data; data exchange; data representation; path expression query optimization; path index; space complexity; time complexity; Data engineering; Data models; Database languages; Fabrics; Indexes; Information security; Internet; Navigation; Query processing; XML;
Conference_Titel :
e-Business Engineering, 2005. ICEBE 2005. IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
0-7695-2430-3
DOI :
10.1109/ICEBE.2005.93