• DocumentCode
    2403597
  • Title

    Structural joins: a primitive for efficient XML query pattern matching

  • Author

    Al-Khalifa, Shurug ; Agadish, H. V J ; Koudas, Nick ; Patel, Jignesh M. ; Srivastava, Divesh ; Wu, YuqingWu

  • fYear
    2002
  • fDate
    2002
  • Firstpage
    141
  • Lastpage
    152
  • Abstract
    XML queries typically specify patterns of selection predicates on multiple elements that have some specified tree structured relationships. The primitive tree structured relationships are parent-child and ancestor-descendant, and finding all occurrences of these relationships in an XML database is a core operation for XML query processing. We develop two families of structural join algorithms for this task: tree-merge and stack-tree. The tree-merge algorithms are a natural extension of traditional merge joins and the multi-predicate merge joins, while the stack-tree algorithms have no counterpart in traditional relational join processing. We present experimental results on a range of data and queries using the TIMBER native XML query engine built on top of SHORE. We show that while, in some cases, tree-merge algorithms can have performance comparable to stack-tree algorithms, in many cases they are considerably worse. This behavior is explained by analytical results that demonstrate that, on sorted inputs, the stack-tree algorithms have worst-case I/O and CPU complexities linear in the sum of the sizes of inputs and output, while the tree-merge algorithms do not have the same guarantee
  • Keywords
    hypermedia markup languages; merging; pattern matching; query processing; relational algebra; relational databases; tree data structures; CPU complexity; SHORE; TIMBER; XML; experimental results; merge joins; multi-predicate merge joins; performance; query engine; query pattern matching; query processing; relational join processing; selection predicates; sorted inputs; stack-tree; structural join algorithms; tree data structures; tree-merge; Algorithm design and analysis; Books; Data engineering; Database languages; Database systems; Engines; Pattern matching; Query processing; Relational databases; XML;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering, 2002. Proceedings. 18th International Conference on
  • Conference_Location
    San Jose, CA
  • ISSN
    1063-6382
  • Print_ISBN
    0-7695-1531-2
  • Type

    conf

  • DOI
    10.1109/ICDE.2002.994704
  • Filename
    994704