DocumentCode :
1763754
Title :
Extremely Low Bit-Rate Nearest Neighbor Search Using a Set Compression Tree
Author :
Arandjelovic, Relja ; Zisserman, Andrew
Author_Institution :
Dept. of Eng. Sci., Univ. of Oxford, Oxford, UK
Volume :
36
Issue :
12
fYear :
2014
fDate :
Dec. 1 2014
Firstpage :
2396
Lastpage :
2406
Abstract :
The goal of this work is a data structure to support approximate nearest neighbor search on very large scale sets of vector descriptors. The criteria we wish to optimize are: (i) that the memory footprint of the representation should be very small (so that it fits into main memory); and (ii) that the approximation of the original vectors should be accurate. We introduce a novel encoding method, named a Set Compression Tree (SCT), that satisfies these criteria. It is able to accurately compress 1 million descriptors using only a few bits per descriptor. The large compression rate is achieved by not compressing on a per-descriptor basis, but instead by compressing the set of descriptors jointly. We describe the encoding, decoding and use for nearest neighbor search, all of which are quite straightforward to implement. The method, tested on standard benchmarks (SIFT1M and 80 Million Tiny Images), achieves superior performance to a number of state-of-the-art approaches, including Product Quantization, Locality Sensitive Hashing, Spectral Hashing, and Iterative Quantization. For example, SCT has a lower error using 5 bits than any of the other approaches, even when they use 16 or more bits per descriptor. We also include a comparison of all the above methods on the standard benchmarks.
Keywords :
pattern classification; search problems; set theory; approximate nearest neighbor search; data structure; encoding method; iterative quantization; locality sensitive hashing; low bit-rate nearest neighbor search; product quantization; set compression tree; spectral hashing; standard benchmarks; vector descriptors; very large scale sets; Approximation methods; Benchmark testing; Encoding; Image coding; Iterative decoding; Nearest neighbor searches; Quantization (signal); Approximate search; image indexing; large-scale image search; quantization; very large databases;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/TPAMI.2014.2339821
Filename :
6858078
Link To Document :
بازگشت