DocumentCode
2630167
Title
Optimizing regular path expressions using graph schemas
Author
Fernández, Mary ; Suciu, Dan
Author_Institution
AT & T Labs., Florham Park, NJ, USA
fYear
1998
fDate
23-27 Feb 1998
Firstpage
14
Lastpage
23
Abstract
Query languages for data with irregular structure use regular path expressions for navigation. This feature is useful for querying data where parts of the structure is either unknown, unavailable to the user, or changes frequently. Naive execution of regular path expressions is inefficient however, because it ignores any structure in the data. We describe two optimization techniques for queries with regular path expressions. Both rely on graph schemas for specifying partial knowledge about the data´s structure. Query pruning uses this structure to restrict navigation to only a fragment of the data; we give an efficient algorithm for rewriting any regular path expression query into a pruned one. Query rewriting using state extents can eliminate or reduce navigation altogether; it is reminiscent of optimizing relational queries using indices. There may be several ways to optimize a query using state extents; we give a polynomial space algorithm that finds all such optimizations. For restricted forms of regular path expressions, the algorithm is provably efficient. We also give an efficient approximation algorithm that works on all regular path expressions
Keywords
data structures; formal specification; graph theory; query languages; query processing; rewriting systems; approximation algorithm; data querying; graph schemas; irregular structure; naive execution; optimization techniques; partial knowledge; polynomial space algorithm; query languages; query pruning; query rewriting; regular path expression optimization; state extents; Approximation algorithms; Biological system modeling; Data models; Data structures; Data systems; Database languages; Navigation; Polynomials; Relational databases; Tree graphs;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Engineering, 1998. Proceedings., 14th International Conference on
Conference_Location
Orlando, FL
ISSN
1063-6382
Print_ISBN
0-8186-8289-2
Type
conf
DOI
10.1109/ICDE.1998.655753
Filename
655753
Link To Document