DocumentCode :
3267865
Title :
LDC: enabling search by partial distance in a hyper-dimensional space
Author :
Koudas, Nick ; Ooi, Beng Chin ; Shen, Heng Tao ; Tung, Anthony K H
Author_Institution :
Shannon Lab., AT&T Labs Res., Basking Ridge, NJ, USA
fYear :
2004
fDate :
30 March-2 April 2004
Firstpage :
6
Lastpage :
17
Abstract :
Recent advances in research fields like multimedia and bioinformatics have brought about a new generation of hyper-dimensional databases which can contain hundreds or even thousands of dimensions. Such hyper-dimensional databases pose significant problems to existing high-dimensional indexing techniques which have been developed for indexing databases with (commonly) less than a hundred dimensions. To support efficient querying and retrieval on hyper-dimensional databases, we propose a methodology called local digital coding (LDC) which can support k-nearest neighbors (KNN) queries on hyper-dimensional databases and yet co-exist with ubiquitous indices, such as B+-trees. LDC extracts a simple bitmap representation called digital code(DC) for each point in the database. Pruning during KNN search is performed by dynamically selecting only a subset of the bits from the DC based on which subsequent comparisons are performed. In doing so, expensive operations involved in computing L-norm distance functions between hyper-dimensional data can be avoided. Extensive experiments are conducted to show that our methodology offers significant performance advantages over other existing indexing methods on both real life and synthetic hyper-dimensional datasets.
Keywords :
indexing; multimedia databases; query processing; tree data structures; tree searching; B+-tree; bioinformatics; bitmap representation; hyper-dimensional database; indexing database; indexing technique; local digital coding; multimedia; Audio databases; Bioinformatics; Computer science; Data mining; Extraterrestrial phenomena; Image databases; Indexing; Information retrieval; Laboratories; Multimedia databases;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering, 2004. Proceedings. 20th International Conference on
ISSN :
1063-6382
Print_ISBN :
0-7695-2065-0
Type :
conf
DOI :
10.1109/ICDE.2004.1319980
Filename :
1319980
Link To Document :
بازگشت