Title :
Spatial approximation + sequential scan = efficient metric indexing
Author :
Cháve, Edgar ; Herrera, Norma ; Reyes, Nora
Author_Institution :
Escuela de Ciencias Fisico-Matematicas, Univ. Michoacana, Morelia, Mexico
Abstract :
Proximity queries in metric spaces, or metric proximity queries for short, have instances suffering from the so-called "curse of dimensionality" referred to the apparent impossibility of beating the brute force approach (an exhaustive search to satisfy a metric proximity query). In other words, the cursed spaces resists indexing. Current research tends to alleviate this particular situation, since easy spaces can be indexed neatly. A recent hypothesis establishes that it is just a fraction of the database which is resilient to indexing, and the proper approach would be to index each part separately. This is precisely the approach of this work. We use two different techniques for indexing complementary parts of a metric database. The dynamic spatial approximation tree (dsa-tree) is a recently proposed data structure for searching in metric spaces. It has been shown that it compares favorably against alternative data structures in spaces of high dimension or queries with low selectivity On the other hand, pivot-based algorithms are effective tools for proximity searching in any metric spaces, if sufficient storage space is available. In this work we combine the concepts of spatial approximation and pivot-based algorithms. The new data structure maintains the full features of a dsa-tree while using the available memory to improve the query time. We show experimentally that our data structure is competitive choice for searching in cursed metric spaces.
Keywords :
database indexing; query processing; search problems; spatial data structures; tree data structures; brute force approach; curse of dimensionality; cursed spaces; data structure; dsa-tree; dynamic spatial approximation tree; metric database; metric indexing; metric spaces; pivot-based algorithm; proximity queries; proximity searching; sequential scan; Artificial intelligence; Data structures; Extraterrestrial measurements; Fingerprint recognition; Indexes; Indexing; Information retrieval; Resists; Spatial databases; Tree data structures;
Conference_Titel :
Computer Science Society, 2004. SCCC 2004. 24th International Conference of the Chilean
Print_ISBN :
0-7695-2185-1
DOI :
10.1109/QEST.2004.20