Title :
Efficient creation and incremental maintenance of the HOPI index for complex XML document collections
Author :
Schenkel, Ralf ; Theobald, Anja ; Weikum, Gerhard
Author_Institution :
Max-Planck-Inst. fur Inf., Saarbrucken, Germany
Abstract :
The HOPI index, a connection index for XML documents based on the concept of a 2-hop cover, provides space- and time-efficient reachability tests along the ancestor, descendant, and link axes to support path expressions with wildcards in XML search engines. This paper presents enhanced algorithms for building HOPI, shows how to augment the index with distance information, and discusses incremental index maintenance. Our experiments show substantial improvements over the existing divide-and-conquer algorithm for index creation, low space overhead for including distance information in the index, and efficient updates.
Keywords :
XML; data structures; database indexing; query processing; reachability analysis; search engines; HOPI incremental index maintenance; HOPI index creation; XML document collection; XML search engine; divide-and-conquer algorithm; path expression; space-efficient reachability test; time-efficient reachability test; Availability; Data engineering; Encoding; Intelligent structures; Interference; Portals; Search engines; Testing; Tree graphs; XML;
Conference_Titel :
Data Engineering, 2005. ICDE 2005. Proceedings. 21st International Conference on
Print_ISBN :
0-7695-2285-8
DOI :
10.1109/ICDE.2005.57