Title :
SPEC hashing: Similarity preserving algorithm for entropy-based coding
Author :
Lin, Ruei-Sung ; Ross, David A. ; Yagnik, Jay
Author_Institution :
Google Inc., Mountain View, CA, USA
Abstract :
Searching approximate nearest neighbors in large scale high dimensional data set has been a challenging problem. This paper presents a novel and fast algorithm for learning binary hash functions for fast nearest neighbor retrieval. The nearest neighbors are defined according to the semantic similarity between the objects. Our method uses the information of these semantic similarities and learns a hash function with binary code such that only objects with high similarity have small Hamming distance. The hash function is incrementally trained one bit at a time, and as bits are added to the hash code Hamming distances between dissimilar objects increase. We further link our method to the idea of maximizing conditional entropy among pair of bits and derive an extremely efficient linear time hash learning algorithm. Experiments on similar image retrieval and celebrity face recognition show that our method produces apparent improvement in performance over some state-of-the-art methods.
Keywords :
cryptography; entropy codes; face recognition; image coding; image retrieval; SPEC hashing; approximate nearest neighbors; entropy-based coding; face recognition; fast nearest neighbor retrieval; hash code Hamming distances; large scale high dimensional data set; linear time hash learning algorithm; similarity preserving algorithm; Binary codes; Boosting; Entropy; Face recognition; Hamming distance; Image retrieval; Large-scale systems; Music information retrieval; Nearest neighbor searches; Vectors;
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on
Conference_Location :
San Francisco, CA
Print_ISBN :
978-1-4244-6984-0
DOI :
10.1109/CVPR.2010.5540129