Title :
Predicate-based Filtering of XPath Expressions
Author :
Hou, Shuang ; Jacobsen, Hans-Arno
Author_Institution :
University of Toronto, Canada
Abstract :
The XML/XPath filtering problem has found wide-spread interest. In this paper, we propose a novel algorithm for solving it. Our approach encodes XPath expressions (XPEs) as ordered sets of predicates and translates XML documents into sets of tuples, which are evaluated over these predicates. Predicates representing overlapping portions of XPEs are stored and processed once, thus fully exploiting potential overlap in XPEs. We experimentally evaluate the performance of our algorithm, demonstrating its scalability to millions of XPEs, with matching performance in the millisecond range. We show interesting trade-offs to alternative approaches.
Keywords :
Algorithm design and analysis; Automata; Engines; Filtering algorithms; Filters; Information filtering; Jacobian matrices; Routing; Scalability; XML;
Conference_Titel :
Data Engineering, 2006. ICDE '06. Proceedings of the 22nd International Conference on
Print_ISBN :
0-7695-2570-9
DOI :
10.1109/ICDE.2006.115