DocumentCode :
1437853
Title :
Optimal Hash Functions for Approximate Matches on the n -Cube
Author :
Gordon, Daniel M. ; Miller, Victor S. ; Ostapenko, Peter
Author_Institution :
IDA Center for Commun. Res., San Diego, CA, USA
Volume :
56
Issue :
3
fYear :
2010
fDate :
3/1/2010 12:00:00 AM
Firstpage :
984
Lastpage :
991
Abstract :
One way to find near-matches in large datasets is to use hash functions. In recent years locality-sensitive hash functions for various metrics have been given; for the Hamming metric projecting onto k bits is simple hash function that performs well. In this paper, we investigate alternatives to projection. For various parameters hash functions given by complete decoding algorithms for error-correcting codes work better, and asymptotically random codes perform better than projection.
Keywords :
approximation theory; cryptography; decoding; error correction codes; random codes; Hamming metric; decoding algorithms; error-correcting codes; optimal hash functions; random codes; Decoding; Error correction codes; Hamming distance; Information retrieval; Linear code; Nearest neighbor searches; Sequences; Vectors; Approximate nearest neighbor; distance-sum optimal sets; error exponent; locality-sensitive hashing; random codes;
fLanguage :
English
Journal_Title :
Information Theory, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9448
Type :
jour
DOI :
10.1109/TIT.2009.2039037
Filename :
5429136
Link To Document :
بازگشت