Title :
Speeding Up Permutation Based Indexing with Indexing
Author :
Figueroa, Karina ; Frediksson, K.
Author_Institution :
Fac. de Cienc. Fisico-Mat., Univ. Michoacana, Morelia, Mexico
Abstract :
A recent probabilistic approach for searching in high dimensional metric spaces is based on predicting the distances between database elements according to how they order their distances towards some set of distinguished elements, called permutants. In the preprocessing phase a set of permutants is chosen, and are sorted (permuted) by their distances against every database element. The permutations form the index. When a query is given, its corresponding permutation is computed, and --- as similar elements will (probably) have a similar permutation --- the database is compared in the order induced by the similarity between permutations. This works well but has relatively high CPU time due to computing the distances between permutations and (partially) sorting the database by the similarity. We improve this by identifying and solving this as another metric space problem. This avoids many distance computations between the permutants. The experimental results show that this works extremely well in practice.
Keywords :
database indexing; probability; query processing; search problems; sorting; CPU time; database indexing; database sorting; dimensional metric space search problem; preprocessing phase; probabilistic approach; query processing; speeding-up permutation indexing; Application software; Audio databases; Biology computing; Computer science; Extraterrestrial measurements; Image coding; Image databases; Indexing; Quantization; Sorting; indexing permutations; metric space indexing; probabilistic algorithms;
Conference_Titel :
Similarity Search and Applications, 2009. SISAP '09. Second International Workshop on
Conference_Location :
Prague
Print_ISBN :
978-0-7695-3765-8
DOI :
10.1109/SISAP.2009.12