Title :
Counting distance permutations
Author_Institution :
David R. Cheriton Sch. of Comput. Sci., Univ. of Waterloo, Waterloo, ON
Abstract :
A distance permutation index supports fast proximity searching in a high-dimensional metric space. Given some fixed reference sites, for each point in a database the index stores a permutation naming the closest site, the second-closest, and so on. We examine how many distinct permutations can occur as a function of the number of sites and the size of the space. We give theoretical results for tree metrics and vector spaces with L1, L2, and Linfin metrics, improving on the previous best known storage space in the vector case. We also give experimental results and commentary on the number of distance permutations that actually occur in a variety of vector, string, and document spaces.
Keywords :
database indexing; query processing; trees (mathematics); counting distance permutation index; document spaces; fixed reference sites; high-dimensional metric space; index stores; vector spaces; Audio databases; Computer science; Costs; Data structures; Distance measurement; Extraterrestrial measurements; Genetics; Image databases; Image recognition; Indexes;
Conference_Titel :
Data Engineering Workshop, 2008. ICDEW 2008. IEEE 24th International Conference on
Conference_Location :
Cancun
Print_ISBN :
978-1-4244-2161-9
Electronic_ISBN :
978-1-4244-2162-6
DOI :
10.1109/ICDEW.2008.4498346