DocumentCode
130328
Title
Experimental evaluation of selected tree structures for exact and approximate k-nearest neighbor classification
Author
Cislak, Aleksander ; Grabowski, Szymon
Author_Institution
Dept. of Inf., Tech. Univ. of Munich, Garching, Germany
fYear
2014
fDate
7-10 Sept. 2014
Firstpage
93
Lastpage
100
Abstract
Spatial data structures, for vector or metric spaces, are a well-known means to speed-up proximity queries. One of the common uses of the found neighbors of the query object is in classification methods, e.g., the famous k-nearest neighbor algorithm. Still, most experimental works focus on providing attractive tradeoffs between neighbor search times and the neighborhood quality, but they ignore the impact of such tradeoffs on the classification accuracy. In this paper, we explore a few simple approximate and probabilistic variants of two popular spatial data structures, the k-d tree and the ball tree, with k-NN results on real data sets. The main difference between these two structures is the location of input data - in all nodes (k-d tree), or in the leaves (ball tree) - and for this reason they act as good representatives of other spatial structures. We show that in several cases significant speedups compared to the use of such structures in the exact k-NN classification are possible, with a moderate penalty in accuracy. We conclude that the usage of the k-d tree is a more promising approach.
Keywords
pattern classification; tree data structures; ball tree; k-NN classification; k-d tree; k-nearest neighbor classification; spatial data structures; tree structures;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Science and Information Systems (FedCSIS), 2014 Federated Conference on
Conference_Location
Warsaw
Type
conf
DOI
10.15439/2014F194
Filename
6933001
Link To Document