• DocumentCode
    4602
  • Title

    Scalable Distributed Processing of K Nearest Neighbor Queries over Moving Objects

  • Author

    Ziqiang Yu ; Yang Liu ; Xiaohui Yu ; Pu, Ken Q.

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Shandong Univ., Jinan, China
  • Volume
    27
  • Issue
    5
  • fYear
    2015
  • fDate
    May 1 2015
  • Firstpage
    1383
  • Lastpage
    1396
  • Abstract
    Central to many applications involving moving objects is the task of processing k-nearest neighbor (k-NN) queries. Most of the existing approaches to this problem are designed for the centralized setting where query processing takes place on a single server; it is difficult, if not impossible, for them to scale to a distributed setting to handle the vast volume of data and concurrent queries that are increasingly common in those applications. To address this problem, we propose a suite of solutions that can support scalable distributed processing of k-NN queries. We first present a new index structure called Dynamic Strip Index (DSI), which can better adapt to different data distributions than exiting grid indexes. Moreover, it can be naturally distributed across the cluster, therefore lending itself well to distributed processing. We further propose a distributed k-NN search (DKNN) algorithm based on DSI. DKNN avoids having an uncertain number of potentially expensive iterations, and is thus more efficient and more predictable than existing approaches. DSI and DKNN are implemented on Apache S4, an open-source platform for distributed stream processing. We perform extensive experiments to study the characteristics of DSI and DKNN, and compare them with three baseline methods. Experimental results show that our proposal scales well and significantly outperforms the alternative methods.
  • Keywords
    data handling; distributed processing; query processing; Apache S4 open-source platform; DKNN algorithm; DSI; distributed k-NN search algorithm; distributed stream processing; dynamic strip index; k-NN query processing; k-nearest neighbor query processing; moving objects; scalable distributed processing; Educational institutions; Indexes; Query processing; Search problems; Servers; Silicon; Strips; distributed query processing; k nearest neighbor query; moving objects;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2014.2364046
  • Filename
    6930803