• DocumentCode
    2847076
  • Title

    Adaptive Processing of Top-k Queries in XML

  • Author

    Marian, Amélie ; Amer-Yahia, Sihem ; Koudas, Nick ; Srivastava, Divesh

  • Author_Institution
    Columbia Univ., NY, USA
  • fYear
    2005
  • fDate
    05-08 April 2005
  • Firstpage
    162
  • Lastpage
    173
  • Abstract
    The ability to compute top-k matches to XML queries is gaining importance due to the increasing number of large XML repositories. The efficiency of top-k query evaluation relies on using scores to prune irrelevant answers as early as possible in the evaluation process. In this context, evaluating the same query plan for all answers might be too rigid because, at any time in the evaluation, answers have gone through the same number and sequence of operations, which limits the speed at which scores grow. Therefore, adaptive query processing that permits different plans for different partial matches and maximizes the best scores is more appropriate. In this paper, we propose an architecture and adaptive algorithms for efficiently computing top-k matches to XML queries. Our techniques can be used to evaluate both exact and approximate matches where approximation is defined by relaxing XPath axes. In order to compute the scores of query answers, we extend the traditional tf*idf measure to account for document structure. We conduct extensive experiments on a variety of benchmark data and queries, and demonstrate the usefulness of the adaptive approach for computing top-k queries in XML.
  • Keywords
    XML; query processing; XML queries; adaptive query processing; top-k queries; Adaptive algorithm; Books; Computer architecture; Frequency measurement; Information retrieval; Joining processes; Libraries; Query processing; XML;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering, 2005. ICDE 2005. Proceedings. 21st International Conference on
  • ISSN
    1084-4627
  • Print_ISBN
    0-7695-2285-8
  • Type

    conf

  • DOI
    10.1109/ICDE.2005.18
  • Filename
    1410117