DocumentCode :
270304
Title :
Beyond “project and sign” for cosine estimation with binary codes
Author :
Balu, Radhakrishnan ; Furon, Teddy ; Jégou, Hervé
fYear :
2014
fDate :
4-9 May 2014
Firstpage :
68884
Lastpage :
6888
Abstract :
Many nearest neighbor search algorithms rely on encoding real vectors into binary vectors. The most common strategy projects the vectors onto random directions and takes the sign to produce so-called sketches. This paper discusses the sub-optimality of this choice, and proposes a better encoding strategy based on the quantization and reconstruction points of view. Our second contribution is a novel asymmetric estimator for the cosine similarity. Similar to previous asymmetric schemes, the query is not quantized and the similarity is computed in the compressed domain. Both our contribution leads to improve the quality of nearest neighbor search with binary codes. Its efficiency compares favorably against a recent encoding technique.
Keywords :
binary codes; search problems; asymmetric estimator; binary codes; binary vectors; cosine estimation; cosine similarity; encoding strategy; nearest neighbor search algorithms; project and sign; random directions; Binary codes; Databases; Encoding; Estimation; Hamming distance; Quantization (signal); Vectors; Hamming embedding; Locality sensitive hashing; approximate nearest neighbors; similarity search;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
Type :
conf
DOI :
10.1109/ICASSP.2014.6854934
Filename :
6854934
Link To Document :
بازگشت