Title :
Cost-based unbalanced R-trees
Author :
Ross, Kenneth A. ; Sitzmann, Inga ; Stuckey, Peter J.
Author_Institution :
Dept. of Comput. Sci., Columbia Univ., New York, NY, USA
Abstract :
Cost-based unbalanced R-trees (CUR-trees) are a cost-function-based data structure for spatial data. CUR-trees are constructed specifically to improve the evaluation of intersection queries, the most basic selection query in an R-tree. A CUR-tree is built taking into account a given query distribution for the queries and a cost model for their execution. Depending on the expected frequency of access, objects or subtrees are stored higher up in the tree. After each insertion in the tree, local reorganizations of a node and its children have their expected query cost evaluated, and a reorganization is performed if this is beneficial. No strict balancing of the trees applies, allowing the tree to unfold solely based on the result of the cost evaluation. We present our cost-based approach and describe the evaluation and reorganization operations based on the cost function. We present a cost model for in-memory access costs and we present three different query models. In our experiments, we compare the performance of the CUR-tree to the R-tree and the R*-tree. The CUR-tree is able to significantly improve intersection query performance, without unacceptably increasing the cost of building the tree. The use of R-trees for in-memory data reflects the high (and growing) cost of bringing data from RAM into the CPU cache relative to the cost of other computations
Keywords :
cache storage; costing; query processing; software performance evaluation; tree data structures; CPU cache; CUR-trees; R*-tree; RAM; access frequency; computational cost; cost function-based data structure; cost-based unbalanced R-trees; in-memory access costs; intersection query evaluation; intersection query performance; local node reorganizations; query cost evaluation; query distribution; query execution cost model; query models; selection query; spatial data; subtrees; tree unfolding; Computational efficiency; Computer applications; Computer science; Cost function; Data structures; Design automation; Frequency; Performance evaluation; Read-write memory; Spatial indexes;
Conference_Titel :
Scientific and Statistical Database Management, 2001. SSDBM 2001. Proceedings. Thirteenth International Conference on
Conference_Location :
Fairfax, VA
Print_ISBN :
0-7695-1218-6
DOI :
10.1109/SSDM.2001.938552