• DocumentCode
    694402
  • Title

    A novel KNN join algorithms based on Hilbert R-tree in MapReduce

  • Author

    Qinsheng Du ; Xiongfei Li

  • Author_Institution
    Coll. of Comput. Sci. & Technol., Jilin Univ., Changchun, China
  • fYear
    2013
  • fDate
    12-13 Oct. 2013
  • Firstpage
    417
  • Lastpage
    420
  • Abstract
    The kNN join (k nearest neighbor join) is a primitive operation widely adopted by many data mining applications. From a dataset S, it can find k nearest neighbors for every object in another dataset R. It is a combination of the k nearest neighbor query and the join operation. In this paper we perform kNN join in MapReduce. For kNN joins, the block nested loop methodology is the direct method. It is very efficient to load a R-tree and search kNN in R-tree. On the base, we build a Hilbert R-tree index for the local S block in a bucket. It can help us find kNNs in the same bucket. Extensive experiments demonstrate that our proposed methods are efficient.
  • Keywords
    indexing; query processing; tree data structures; Hilbert R-tree index; MapReduce; block nested loop methodology; data mining applications; join operation; k nearest neighbor join; k nearest neighbor query; kNN join algorithms; Algorithm design and analysis; Clustering algorithms; Computer science; Data mining; Educational institutions; Partitioning algorithms; Vectors; Hilbert R-tree; K-Nearest Neighbor; R-tree; Space-filling curve;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Network Technology (ICCSNT), 2013 3rd International Conference on
  • Conference_Location
    Dalian
  • Type

    conf

  • DOI
    10.1109/ICCSNT.2013.6967143
  • Filename
    6967143