• DocumentCode
    3601455
  • Title

    Spherical Hashing: Binary Code Embedding with Hyperspheres

  • Author

    Jae-Pil Heo ; Youngwoon Lee ; Junfeng He ; Shih-Fu Chang ; Sung-Eui Yoon

  • Author_Institution
    Dept. of Comput. Sci., Korea Adv. Inst. of Sci. & Technol., Daejeon, South Korea
  • Volume
    37
  • Issue
    11
  • fYear
    2015
  • Firstpage
    2304
  • Lastpage
    2316
  • Abstract
    Many binary code embedding schemes have been actively studied recently, since they can provide efficient similarity search, and compact data representations suitable for handling large scale image databases. Existing binary code embedding techniques encode high-dimensional data by using hyperplane-based hashing functions. In this paper we propose a novel hypersphere-based hashing function, spherical hashing, to map more spatially coherent data points into a binary code compared to hyperplane-based hashing functions. We also propose a new binary code distance function, spherical Hamming distance, tailored for our hypersphere-based binary coding scheme, and design an efficient iterative optimization process to achieve both balanced partitioning for each hash function and independence between hashing functions. Furthermore, we generalize spherical hashing to support various similarity measures defined by kernel functions. Our extensive experiments show that our spherical hashing technique significantly outperforms state-of-the-art techniques based on hyperplanes across various benchmarks with sizes ranging from one to 75 million of GIST, BoW and VLAD descriptors. The performance gains are consistent and large, up to 100 percent improvements over the second best method among tested methods. These results confirm the unique merits of using hyperspheres to encode proximity regions in high-dimensional spaces. Finally, our method is intuitive and easy to implement.
  • Keywords
    Hamming codes; binary codes; data structures; file organisation; iterative methods; optimisation; visual databases; BoW descriptor; GIST descriptor; VLAD descriptor; balanced partitioning; binary code distance function; binary code embedding technique; compact data representation; high-dimensional data encoding; high-dimensional space; hyperplane-based hashing function; hyperplanes; hypersphere-based hashing function; iterative optimization process; kernel function; large scale image database handling; proximity region encoding; similarity measure; similarity search; spatially coherent data points; spherical Hamming distance; spherical hashing technique; Binary codes; Hamming distance; Image databases; Kernel; Measurement; Optimization; Quantization (signal); Hashing; Large-scale image search; binary codes; hashing; large-scale image search;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.2015.2408363
  • Filename
    7051256