Title :
Beyond “project and sign” for cosine estimation with binary codes
Author :
Balu, Radhakrishnan ; Furon, Teddy ; JeÌgou, HerveÌ
Abstract :
Many nearest neighbor search algorithms rely on encoding real vectors into binary vectors. The most common strategy projects the vectors onto random directions and takes the sign to produce so-called sketches. This paper discusses the sub-optimality of this choice, and proposes a better encoding strategy based on the quantization and reconstruction points of view. Our second contribution is a novel asymmetric estimator for the cosine similarity. Similar to previous asymmetric schemes, the query is not quantized and the similarity is computed in the compressed domain. Both our contribution leads to improve the quality of nearest neighbor search with binary codes. Its efficiency compares favorably against a recent encoding technique.
Keywords :
binary codes; search problems; asymmetric estimator; binary codes; binary vectors; cosine estimation; cosine similarity; encoding strategy; nearest neighbor search algorithms; project and sign; random directions; Binary codes; Databases; Encoding; Estimation; Hamming distance; Quantization (signal); Vectors; Hamming embedding; Locality sensitive hashing; approximate nearest neighbors; similarity search;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
DOI :
10.1109/ICASSP.2014.6854934