• DocumentCode
    1120231
  • Title

    Efficiently Querying Large XML Data Repositories: A Survey

  • Author

    Gou, Gang ; Chirkova, Rada

  • Author_Institution
    North Carolina State Univ., Raleigh
  • Volume
    19
  • Issue
    10
  • fYear
    2007
  • Firstpage
    1381
  • Lastpage
    1403
  • Abstract
    Extensible markup language (XML) is emerging as a de facto standard for information exchange among various applications on the World Wide Web. There has been a growing need for developing high-performance techniques to query large XML data repositories efficiently. One important problem in XML query processing is twig pattern matching, that is, finding in an XML data tree D all matches that satisfy a specified twig (or path) query pattern Q. In this survey, we review, classify, and compare major techniques for twig pattern matching. Specifically, we consider two classes of major XML query processing techniques: the relational approach and the native approach. The relational approach directly utilizes existing relational database systems to store and query XML data, which enables the use of all important techniques that have been developed for relational databases, whereas in the native approach, specialized storage and query processing systems tailored for XML data are developed from scratch to further improve XML query performance. As implied by existing work, XML data querying and management are developing in the direction of integrating the relational approach with the native approach, which could result in higher query processing performance and also significantly reduce system reengineering costs.
  • Keywords
    XML; pattern matching; query processing; relational databases; very large databases; World Wide Web; data tree; extensible markup language; information exchange; large XML data repository; native approach; query processing; relational database system; twig pattern matching; Books; Data models; HTML; Markup languages; Organizing; Pattern matching; Query processing; Relational databases; Web sites; XML; XML query processing; twig pattern matching;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2007.1060
  • Filename
    4302745