DocumentCode
1961853
Title
Analyzing range queries on spatial data
Author
Jin, Ji ; An, Ning ; Sivasubramaniam, Anand
Author_Institution
Dept. of Comput. Sci. & Eng., Pennsylvania State Univ., University Park, PA, USA
fYear
2000
fDate
2000
Firstpage
525
Lastpage
534
Abstract
Analysis of range queries on spatial (multidimensional) data is both important and challenging. Most previous analysis attempts have made certain simplifying assumptions about the data sets and/or queries to keep the analysis tractable. As a result, they may no be universally applicable. This paper proposes a set of five analysis techniques to estimate the selectivity and number of index nodes accessed in serving a range query. The underlying philosophy behind these techniques is to maintain an auxiliary data structure called a density file, whose creation is a one-time cost, which can be quickly consulted when the query is given. The schemes differ in what information is kept in the density file, how it is maintained and how this information is looked up. It is shown that one of the proposed schemes, called “cumulative density” (CD), gives very accurate results (usually less then 5% error) using a diverse suite of point and rectangular data sets, that are uniform or skewed, and a wide range of query window parameters. The estimation takes a constant amount of time, which is typically lower than 1% of the time that it would take to execute the query, regardless of data set or query window parameters
Keywords
database theory; query processing; spatial data structures; visual databases; auxiliary data structure; cumulative density; data lookup method; density file; index node access; information maintenance; multidimensional data; one-time cost; point data sets; query window parameters; range query analysis; rectangular data sets; selectivity estimation; skewed data sets; spatial data; uniform data sets; Application software; Computer science; Costs; Electronic switching systems; Geographic Information Systems; Image databases; Information retrieval; Multidimensional systems; Performance analysis; Spatial databases;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Engineering, 2000. Proceedings. 16th International Conference on
Conference_Location
San Diego, CA
ISSN
1063-6382
Print_ISBN
0-7695-0506-6
Type
conf
DOI
10.1109/ICDE.2000.839451
Filename
839451
Link To Document