• DocumentCode
    2528876
  • Title

    Multi-query Optimization for Distributed Similarity Query Processing

  • Author

    Zhuang, Yi ; Li, Qing ; Chen, Lei

  • Author_Institution
    Zhejiang Univ., Hangzhou
  • fYear
    2008
  • fDate
    17-20 June 2008
  • Firstpage
    639
  • Lastpage
    646
  • Abstract
    This paper considers a multi-query optimization issue for distributed similarity query processing, which attempts to exploit the dependencies in the derivation of a query evaluation plan. To the best of our knowledge, this is the first work investigating a multi- query optimization technique for distributed similarity query processing (MDSQ). Four steps are incorporated in our MDSQ algorithm. First when a number of query requests(i.e., m query vectors and m radiuses) are simultaneously submitted by users, then a cost-based dynamic query scheduling(DQS) procedure is invoked to quickly and effectively identify the correlation among the query spheres (requests). After that, an index-based vector set reduction is performed at data node level in parallel. Finally, a refinement process of the candidate vectors is conducted to get the answer set. The proposed method includes a cost-based dynamic query scheduling, a Start-Distance(SD)-based load balancing scheme, and an index-based vector set reduction algorithm. The experimental results validate the efficiency and effectiveness of the algorithm in minimizing the response time and increasing the parallelism of I/O and CPU.
  • Keywords
    database indexing; distributed databases; query processing; resource allocation; scheduling; CPU parallelism; I/O parallelism; candidate vector refinement process; cost-based dynamic query scheduling; data node level; distributed similarity query processing; index-based vector set reduction; multiquery optimization; query evaluation; query request; query spheres; query vectors; response time minimization; start-distance-based load balancing; Computer science; Costs; Delay; Dynamic scheduling; Indexing; Load management; Neodymium; Parallel processing; Processor scheduling; Query processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Distributed Computing Systems, 2008. ICDCS '08. The 28th International Conference on
  • Conference_Location
    Beijing
  • ISSN
    1063-6927
  • Print_ISBN
    978-0-7695-3172-4
  • Electronic_ISBN
    1063-6927
  • Type

    conf

  • DOI
    10.1109/ICDCS.2008.58
  • Filename
    4595937