• DocumentCode
    3280415
  • Title

    Data driven multi-index hashing

  • Author

    Ji Wan ; Sheng Tang ; Yongdong Zhang ; Lei Huang ; Jintao Li

  • Author_Institution
    Adv. Comput. Res. Lab., Inst. of Comput. Technol., Beijing, China
  • fYear
    2013
  • fDate
    15-18 Sept. 2013
  • Firstpage
    2670
  • Lastpage
    2673
  • Abstract
    Binary representation for large scale nearest neighbor search received more and more concern recently. Although binary codes can be directly used as indices of the hash tables, correlations between the bits may lead to non-uniform codes distribution and reduce the performance of the hash table. In this paper, we propose a data driven multi-index hashing method for exact nearest neighbor search in Hamming space. By exploring the statistics properties of the dataset, we can separate the correlated bits into different segments during the process of building multiple hash tables, and thus make binary codes distributed as uniformly as possible in each hash table. Experiments conducted on a huge amount of binary codes extracted from the UK Bench dataset show that our method can achieve significant acceleration in searching speed for large scale dataset.
  • Keywords
    image representation; search problems; statistical analysis; UK Bench dataset; binary codes; binary representation; data driven multiindex hashing; hash tables; large scale nearest neighbor search; nonuniform codes distribution; searching speed; statistics properties; Binary codes; Clustering algorithms; Indexing; Nearest neighbor search;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image Processing (ICIP), 2013 20th IEEE International Conference on
  • Conference_Location
    Melbourne, VIC
  • Type

    conf

  • DOI
    10.1109/ICIP.2013.6738550
  • Filename
    6738550