Title :
Selectivity estimation for spatial joins
Author :
An, Ning ; Yang, Zhen-Yu ; Sivasubramaniam, Anand
Author_Institution :
Dept. of Comput. Sci. & Eng., Pennsylvania State Univ., University Park, PA, USA
Abstract :
Spatial joins are important and time consuming operations in spatial database management systems. It is crucial to be able to accurately estimate the performance of these operations so that one can derive efficient query execution plans, and even develop/refine data structures to improve their performance. While estimation techniques for analyzing the performance of other operations, such as range queries, on spatial data has come under scrutiny, the problem of estimating selectivity for spatial joins has been little explored. The limited forays into this area have used parametric techniques, which are largely restrictive on the datasets that they can be used for since they tend to make simplifying assumptions about the nature of the datasets to be joined. Sampling and histogram based techniques, on the other hand, are much less restrictive. However, there has been no prior attempt at understanding the accuracy of sampling techniques, or developing histogram based techniques to estimate the selectivity of spatial joins. Apart from extensively evaluating the accuracy of sampling techniques for the very first time, this paper presents two novel histogram based solutions for spatial join estimation. Using a wide spectrum of both real and synthetic datasets, it is shown that one of our proposed schemes, called Geometric Histograms (GH), can accurately quantify the selectivity of spatial joins
Keywords :
query processing; software performance evaluation; spatial data structures; visual databases; Geometric Histograms; data structures; histogram based techniques; performance; query execution plans; range queries; sampling techniques; selectivity estimation; spatial database; spatial joins; Computer science; Costs; Data engineering; Data structures; Database systems; Filters; Histograms; Nearest neighbor searches; Road transportation; Sampling methods;
Conference_Titel :
Data Engineering, 2001. Proceedings. 17th International Conference on
Conference_Location :
Heidelberg
Print_ISBN :
0-7695-1001-9
DOI :
10.1109/ICDE.2001.914849