• DocumentCode
    2630167
  • Title

    Optimizing regular path expressions using graph schemas

  • Author

    Fernández, Mary ; Suciu, Dan

  • Author_Institution
    AT & T Labs., Florham Park, NJ, USA
  • fYear
    1998
  • fDate
    23-27 Feb 1998
  • Firstpage
    14
  • Lastpage
    23
  • Abstract
    Query languages for data with irregular structure use regular path expressions for navigation. This feature is useful for querying data where parts of the structure is either unknown, unavailable to the user, or changes frequently. Naive execution of regular path expressions is inefficient however, because it ignores any structure in the data. We describe two optimization techniques for queries with regular path expressions. Both rely on graph schemas for specifying partial knowledge about the data´s structure. Query pruning uses this structure to restrict navigation to only a fragment of the data; we give an efficient algorithm for rewriting any regular path expression query into a pruned one. Query rewriting using state extents can eliminate or reduce navigation altogether; it is reminiscent of optimizing relational queries using indices. There may be several ways to optimize a query using state extents; we give a polynomial space algorithm that finds all such optimizations. For restricted forms of regular path expressions, the algorithm is provably efficient. We also give an efficient approximation algorithm that works on all regular path expressions
  • Keywords
    data structures; formal specification; graph theory; query languages; query processing; rewriting systems; approximation algorithm; data querying; graph schemas; irregular structure; naive execution; optimization techniques; partial knowledge; polynomial space algorithm; query languages; query pruning; query rewriting; regular path expression optimization; state extents; Approximation algorithms; Biological system modeling; Data models; Data structures; Data systems; Database languages; Navigation; Polynomials; Relational databases; Tree graphs;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering, 1998. Proceedings., 14th International Conference on
  • Conference_Location
    Orlando, FL
  • ISSN
    1063-6382
  • Print_ISBN
    0-8186-8289-2
  • Type

    conf

  • DOI
    10.1109/ICDE.1998.655753
  • Filename
    655753