• DocumentCode
    248497
  • Title

    Component hashing of variable-length binary aggregated descriptors for fast image search

  • Author

    Zhe Wang ; Ling-Yu Duan ; Jie Lin ; Tiejun Huang ; Wen Gao ; Bober, Miroslaw

  • Author_Institution
    Sch. of EE&CS, Peking Univ., Beijing, China
  • fYear
    2014
  • fDate
    27-30 Oct. 2014
  • Firstpage
    2217
  • Lastpage
    2221
  • Abstract
    Compact locally aggregated binary features have shown great advantages in image search. As the exhaustive linear search in Hamming space still entails too much computational complexity for large datasets, recent works proposed to directly use binary codes as hash indices, yielding a dramatic increase in speedup. However, these methods cannot be directly applied to variable-length binary features. In this paper, we propose a Component Hashing (CoHash) algorithm to handle the variable-length binary aggregated descriptors indexing for fast image search. The main idea is to decompose the distance measure between variable-length descriptors into aligned component-to-component matching problems independently, and build multiple hash tables for the visual word components. Given a query, its candidate neighbors are found by using the query binary sub-vectors as indices into their corresponding hash tables. In particular, a bit selection based on conditional mutual information maximization is proposed to reduce the dimensionality of visual word components, which provides a light storage of indices and balances the retrieval accuracy and search cost. Extensive experiments on benchmark datasets show that our approach is 20~25 times faster than linear search, without any noticeable retrieval performance loss.
  • Keywords
    computational complexity; file organisation; image retrieval; indexing; CoHash; Hamming space; binary codes; compact locally aggregated binary features; component hashing algorithm; computational complexity; conditional mutual information maximization; exhaustive linear search; fast image search; hash indices; multiple hash tables; query binary sub-vectors; retrieval performance loss; variable-length binary aggregated descriptor indexing; variable-length binary features; visual word components; Accuracy; Binary codes; Databases; Hamming distance; Image coding; Vectors; Visualization; Aggregated Descriptors; Component Hashing; Image Search; Variable-length Binary Codes;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image Processing (ICIP), 2014 IEEE International Conference on
  • Conference_Location
    Paris
  • Type

    conf

  • DOI
    10.1109/ICIP.2014.7025449
  • Filename
    7025449