Title :
Down the Rabbit Hole: Robust Proximity Search and Density Estimation in Sublinear Space
Author :
Har-Peled, Sariel ; Kumar, Nirman
Author_Institution :
Dept. of Comput. Sci., Univ. of Illinois, Urbana, IL, USA
Abstract :
For a set of n points in Rd, and parameters k and e, we present a data structure that answers (1 + e)-approximate k nearest neighbor queries in logarithmic time. Surprisingly, the space used by the data-structure is Õ(n/k), that is, the space used is sub linear in the input size if k is sufficiently large. Our approach provides a novel way to summarize geometric data, such that meaningful proximity queries on the data can be carried out using this sketch. Using this we provide a sub linear space data-structure that can estimate the density of a point set under various measures, including: (i) sum of distances of k closest points to the query point, and (ii) sum of squared distances of k closest points to the query point. Our approach generalizes to other distance based estimation of densities of similar flavor.
Keywords :
approximation theory; computational complexity; computational geometry; data structures; pattern classification; query processing; set theory; geometric data summarization; k closest points sum-of-distances; k closest points sum-of-squared distances; k nearest neighbor queries; logarithmic time; point set density estimation; proximity queries; rabbit hole; robust proximity search; sublinear space data-structure; Approximation algorithms; Approximation methods; Artificial neural networks; Clustering algorithms; Complexity theory; Estimation; Standards;
Conference_Titel :
Foundations of Computer Science (FOCS), 2012 IEEE 53rd Annual Symposium on
Conference_Location :
New Brunswick, NJ
Print_ISBN :
978-1-4673-4383-1
DOI :
10.1109/FOCS.2012.31