• DocumentCode
    104890
  • Title

    Fast Exact Search in Hamming Space With Multi-Index Hashing

  • Author

    Norouzi, Mohammad ; Punjani, Ali ; Fleet, David J.

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Toronto, Toronto, ON, Canada
  • Volume
    36
  • Issue
    6
  • fYear
    2014
  • fDate
    Jun-14
  • Firstpage
    1107
  • Lastpage
    1119
  • Abstract
    There is growing interest in representing image data and feature descriptors using compact binary codes for fast near neighbor search. Although binary codes are motivated by their use as direct indices (addresses) into a hash table, codes longer than 32 bits are not being used as such, as it was thought to be ineffective. We introduce a rigorous way to build multiple hash tables on binary code substrings that enables exact k-nearest neighbor search in Hamming space. The approach is storage efficient and straight-forward to implement. Theoretical analysis shows that the algorithm exhibits sub-linear run-time behavior for uniformly distributed codes. Empirical results show dramatic speedups over a linear scan baseline for datasets of up to one billion codes of 64, 128, or 256 bits.
  • Keywords
    cryptography; feature extraction; image coding; image representation; search problems; Hamming space; compact binary codes; distributed codes; fast exact search; feature descriptors; hash table; image data representation; k-nearest neighbor; multiindex hashing; Algorithm design and analysis; Binary codes; Complexity theory; Databases; Hamming distance; Search problems; Upper bound; Binary codes; Hamming distance; large-scale image retrieval; multi index hashing; multi-index hashing; nearest neighbor search;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.2013.231
  • Filename
    6671916