• DocumentCode
    1437853
  • Title

    Optimal Hash Functions for Approximate Matches on the n -Cube

  • Author

    Gordon, Daniel M. ; Miller, Victor S. ; Ostapenko, Peter

  • Author_Institution
    IDA Center for Commun. Res., San Diego, CA, USA
  • Volume
    56
  • Issue
    3
  • fYear
    2010
  • fDate
    3/1/2010 12:00:00 AM
  • Firstpage
    984
  • Lastpage
    991
  • Abstract
    One way to find near-matches in large datasets is to use hash functions. In recent years locality-sensitive hash functions for various metrics have been given; for the Hamming metric projecting onto k bits is simple hash function that performs well. In this paper, we investigate alternatives to projection. For various parameters hash functions given by complete decoding algorithms for error-correcting codes work better, and asymptotically random codes perform better than projection.
  • Keywords
    approximation theory; cryptography; decoding; error correction codes; random codes; Hamming metric; decoding algorithms; error-correcting codes; optimal hash functions; random codes; Decoding; Error correction codes; Hamming distance; Information retrieval; Linear code; Nearest neighbor searches; Sequences; Vectors; Approximate nearest neighbor; distance-sum optimal sets; error exponent; locality-sensitive hashing; random codes;
  • fLanguage
    English
  • Journal_Title
    Information Theory, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9448
  • Type

    jour

  • DOI
    10.1109/TIT.2009.2039037
  • Filename
    5429136