• DocumentCode
    3012969
  • Title

    Selectivity estimation for spatial joins

  • Author

    An, Ning ; Yang, Zhen-Yu ; Sivasubramaniam, Anand

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Pennsylvania State Univ., University Park, PA, USA
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    368
  • Lastpage
    375
  • Abstract
    Spatial joins are important and time consuming operations in spatial database management systems. It is crucial to be able to accurately estimate the performance of these operations so that one can derive efficient query execution plans, and even develop/refine data structures to improve their performance. While estimation techniques for analyzing the performance of other operations, such as range queries, on spatial data has come under scrutiny, the problem of estimating selectivity for spatial joins has been little explored. The limited forays into this area have used parametric techniques, which are largely restrictive on the datasets that they can be used for since they tend to make simplifying assumptions about the nature of the datasets to be joined. Sampling and histogram based techniques, on the other hand, are much less restrictive. However, there has been no prior attempt at understanding the accuracy of sampling techniques, or developing histogram based techniques to estimate the selectivity of spatial joins. Apart from extensively evaluating the accuracy of sampling techniques for the very first time, this paper presents two novel histogram based solutions for spatial join estimation. Using a wide spectrum of both real and synthetic datasets, it is shown that one of our proposed schemes, called Geometric Histograms (GH), can accurately quantify the selectivity of spatial joins
  • Keywords
    query processing; software performance evaluation; spatial data structures; visual databases; Geometric Histograms; data structures; histogram based techniques; performance; query execution plans; range queries; sampling techniques; selectivity estimation; spatial database; spatial joins; Computer science; Costs; Data engineering; Data structures; Database systems; Filters; Histograms; Nearest neighbor searches; Road transportation; Sampling methods;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering, 2001. Proceedings. 17th International Conference on
  • Conference_Location
    Heidelberg
  • ISSN
    1063-6382
  • Print_ISBN
    0-7695-1001-9
  • Type

    conf

  • DOI
    10.1109/ICDE.2001.914849
  • Filename
    914849