DocumentCode
694402
Title
A novel KNN join algorithms based on Hilbert R-tree in MapReduce
Author
Qinsheng Du ; Xiongfei Li
Author_Institution
Coll. of Comput. Sci. & Technol., Jilin Univ., Changchun, China
fYear
2013
fDate
12-13 Oct. 2013
Firstpage
417
Lastpage
420
Abstract
The kNN join (k nearest neighbor join) is a primitive operation widely adopted by many data mining applications. From a dataset S, it can find k nearest neighbors for every object in another dataset R. It is a combination of the k nearest neighbor query and the join operation. In this paper we perform kNN join in MapReduce. For kNN joins, the block nested loop methodology is the direct method. It is very efficient to load a R-tree and search kNN in R-tree. On the base, we build a Hilbert R-tree index for the local S block in a bucket. It can help us find kNNs in the same bucket. Extensive experiments demonstrate that our proposed methods are efficient.
Keywords
indexing; query processing; tree data structures; Hilbert R-tree index; MapReduce; block nested loop methodology; data mining applications; join operation; k nearest neighbor join; k nearest neighbor query; kNN join algorithms; Algorithm design and analysis; Clustering algorithms; Computer science; Data mining; Educational institutions; Partitioning algorithms; Vectors; Hilbert R-tree; K-Nearest Neighbor; R-tree; Space-filling curve;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Science and Network Technology (ICCSNT), 2013 3rd International Conference on
Conference_Location
Dalian
Type
conf
DOI
10.1109/ICCSNT.2013.6967143
Filename
6967143
Link To Document