DocumentCode :
694402
Title :
A novel KNN join algorithms based on Hilbert R-tree in MapReduce
Author :
Qinsheng Du ; Xiongfei Li
Author_Institution :
Coll. of Comput. Sci. & Technol., Jilin Univ., Changchun, China
fYear :
2013
fDate :
12-13 Oct. 2013
Firstpage :
417
Lastpage :
420
Abstract :
The kNN join (k nearest neighbor join) is a primitive operation widely adopted by many data mining applications. From a dataset S, it can find k nearest neighbors for every object in another dataset R. It is a combination of the k nearest neighbor query and the join operation. In this paper we perform kNN join in MapReduce. For kNN joins, the block nested loop methodology is the direct method. It is very efficient to load a R-tree and search kNN in R-tree. On the base, we build a Hilbert R-tree index for the local S block in a bucket. It can help us find kNNs in the same bucket. Extensive experiments demonstrate that our proposed methods are efficient.
Keywords :
indexing; query processing; tree data structures; Hilbert R-tree index; MapReduce; block nested loop methodology; data mining applications; join operation; k nearest neighbor join; k nearest neighbor query; kNN join algorithms; Algorithm design and analysis; Clustering algorithms; Computer science; Data mining; Educational institutions; Partitioning algorithms; Vectors; Hilbert R-tree; K-Nearest Neighbor; R-tree; Space-filling curve;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Science and Network Technology (ICCSNT), 2013 3rd International Conference on
Conference_Location :
Dalian
Type :
conf
DOI :
10.1109/ICCSNT.2013.6967143
Filename :
6967143
Link To Document :
بازگشت