Title :
LSH banding for large-scale retrieval with memory and recall constraints
Author :
Covell, Michele ; Baluja, Shumeet
Author_Institution :
Google Res., Google Inc., Mountain View, CA
Abstract :
Locality Sensitive Hashing (LSH) is widely used for efficient retrieval of candidate matches in very large audio, video, and image systems. However, extremely large reference databases necessitate a guaranteed limit on the memory used by the table lookup itself, no matter how the entries crowd different parts of the signature space, a guarantee that LSH does not give. In this paper, we provide such guaranteed limits, primarily through the design of the LSH bands. When combined with data-adaptive bin splitting (needed on only 0.04% of the occupied bins) this approach provides the required guarantee on memory usage. At the same time, it avoids the reduced recall that more extensive use of bin splitting would give.
Keywords :
file organisation; information retrieval; multimedia databases; candidate matches retrieval; data-adaptive bin splitting; extremely large reference databases; information retrieval; large-scale retrieval; locality sensitive hashing; memory constraints; recall constraints; table lookup; Fingerprint recognition; Image databases; Image retrieval; Information retrieval; Large-scale systems; Memory management; Multimedia databases; Mutual information; Pattern matching; Table lookup; Fingerprint identification; Information retrieval; Multimedia databases; Pattern matching;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2009.4959971