Title :
Searching with expectations
Author :
Sandhawalia, Harsimrat ; Jegou, Herve
Author_Institution :
INRIA, France
Abstract :
Handling large amounts of data, such as large image databases, requires the use of approximate nearest neighbor search techniques. Recently, Hamming embedding methods such as spectral hashing have addressed the problem of obtaining compact binary codes optimizing the trade-off between the memory usage and the probability of retrieving the true nearest neighbors. In this paper, we formulate the problem of generating compact signatures as a rate-distortion problem. In the spirit of source coding algorithms, we aim at minimizing the reconstruction error on the squared distances with a constraint on the memory usage. The vectors are ranked based on the distance estimates to the query vector. Experiments on image descriptors show a significant improvement over spectral hashing.
Keywords :
binary codes; embedded systems; learning (artificial intelligence); pattern classification; probability; query processing; rate distortion theory; search problems; source coding; very large databases; visual databases; Hamming embedding methods; approximate nearest neighbor search techniques; compact binary codes; compact signatures; image descriptors; large image databases; memory usage; probability; query vector; rate-distortion problem; reconstruction error; source coding algorithms; spectral hashing; squared distances; true nearest neighbors; Binary codes; Hamming distance; Image databases; Image reconstruction; Indexing; Memory management; Nearest neighbor searches; Neural networks; Rate-distortion; Source coding; nearest neighbor search; quantization; source coding;
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2010.5495403