DocumentCode
3012969
Title
Selectivity estimation for spatial joins
Author
An, Ning ; Yang, Zhen-Yu ; Sivasubramaniam, Anand
Author_Institution
Dept. of Comput. Sci. & Eng., Pennsylvania State Univ., University Park, PA, USA
fYear
2001
fDate
2001
Firstpage
368
Lastpage
375
Abstract
Spatial joins are important and time consuming operations in spatial database management systems. It is crucial to be able to accurately estimate the performance of these operations so that one can derive efficient query execution plans, and even develop/refine data structures to improve their performance. While estimation techniques for analyzing the performance of other operations, such as range queries, on spatial data has come under scrutiny, the problem of estimating selectivity for spatial joins has been little explored. The limited forays into this area have used parametric techniques, which are largely restrictive on the datasets that they can be used for since they tend to make simplifying assumptions about the nature of the datasets to be joined. Sampling and histogram based techniques, on the other hand, are much less restrictive. However, there has been no prior attempt at understanding the accuracy of sampling techniques, or developing histogram based techniques to estimate the selectivity of spatial joins. Apart from extensively evaluating the accuracy of sampling techniques for the very first time, this paper presents two novel histogram based solutions for spatial join estimation. Using a wide spectrum of both real and synthetic datasets, it is shown that one of our proposed schemes, called Geometric Histograms (GH), can accurately quantify the selectivity of spatial joins
Keywords
query processing; software performance evaluation; spatial data structures; visual databases; Geometric Histograms; data structures; histogram based techniques; performance; query execution plans; range queries; sampling techniques; selectivity estimation; spatial database; spatial joins; Computer science; Costs; Data engineering; Data structures; Database systems; Filters; Histograms; Nearest neighbor searches; Road transportation; Sampling methods;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Engineering, 2001. Proceedings. 17th International Conference on
Conference_Location
Heidelberg
ISSN
1063-6382
Print_ISBN
0-7695-1001-9
Type
conf
DOI
10.1109/ICDE.2001.914849
Filename
914849
Link To Document