Title :
On the integration of structure indexes and inverted lists
Author :
Kaushik, Raghav ; Krishnamurthy, Rajasekar ; Naughton, Jeffrey F. ; Ramakrishnan, Raghu
Author_Institution :
Wisconsin Univ., WI, USA
fDate :
30 March-2 April 2004
Abstract :
Recently, there has been a great deal of interest in the development of techniques to evaluate path expressions over collections of XML documents. In general, these path expressions contain both structural and keyword components. Several methods have been proposed for processing path expressions over graph/tree-structured XML data. These methods can be classified into two broad classes. The first involves graph traversal where the input query is evaluated by traversing the data graph or some compressed representation. The other class involves information-retrieval style processing using inverted lists. In this framework, structure indexes have been proposed to be used as a substitute for graph traversal. Here, we focus on a subclass of CAS queries consisting of simple path expressions. We study algorithmic issues in integrating structure indexes with inverted lists for the evaluation of these queries, where we rank all documents that match the query and return the top k documents in order of relevance.
Keywords :
XML; database indexing; list processing; query formulation; query processing; CAS query; XML document; graph traversal; graph-structured XML data; information-retrieval style processing; inverted list; path expression processing; structure index; tree-structured XML data; Bridges; Content addressable storage; Indexing; Information retrieval; Keyword search; Middleware; Proposals; Query processing; Tree graphs; XML;
Conference_Titel :
Data Engineering, 2004. Proceedings. 20th International Conference on
Print_ISBN :
0-7695-2065-0
DOI :
10.1109/ICDE.2004.1320060