DocumentCode :
3601455
Title :
Spherical Hashing: Binary Code Embedding with Hyperspheres
Author :
Jae-Pil Heo ; Youngwoon Lee ; Junfeng He ; Shih-Fu Chang ; Sung-Eui Yoon
Author_Institution :
Dept. of Comput. Sci., Korea Adv. Inst. of Sci. & Technol., Daejeon, South Korea
Volume :
37
Issue :
11
fYear :
2015
Firstpage :
2304
Lastpage :
2316
Abstract :
Many binary code embedding schemes have been actively studied recently, since they can provide efficient similarity search, and compact data representations suitable for handling large scale image databases. Existing binary code embedding techniques encode high-dimensional data by using hyperplane-based hashing functions. In this paper we propose a novel hypersphere-based hashing function, spherical hashing, to map more spatially coherent data points into a binary code compared to hyperplane-based hashing functions. We also propose a new binary code distance function, spherical Hamming distance, tailored for our hypersphere-based binary coding scheme, and design an efficient iterative optimization process to achieve both balanced partitioning for each hash function and independence between hashing functions. Furthermore, we generalize spherical hashing to support various similarity measures defined by kernel functions. Our extensive experiments show that our spherical hashing technique significantly outperforms state-of-the-art techniques based on hyperplanes across various benchmarks with sizes ranging from one to 75 million of GIST, BoW and VLAD descriptors. The performance gains are consistent and large, up to 100 percent improvements over the second best method among tested methods. These results confirm the unique merits of using hyperspheres to encode proximity regions in high-dimensional spaces. Finally, our method is intuitive and easy to implement.
Keywords :
Hamming codes; binary codes; data structures; file organisation; iterative methods; optimisation; visual databases; BoW descriptor; GIST descriptor; VLAD descriptor; balanced partitioning; binary code distance function; binary code embedding technique; compact data representation; high-dimensional data encoding; high-dimensional space; hyperplane-based hashing function; hyperplanes; hypersphere-based hashing function; iterative optimization process; kernel function; large scale image database handling; proximity region encoding; similarity measure; similarity search; spatially coherent data points; spherical Hamming distance; spherical hashing technique; Binary codes; Hamming distance; Image databases; Kernel; Measurement; Optimization; Quantization (signal); Hashing; Large-scale image search; binary codes; hashing; large-scale image search;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/TPAMI.2015.2408363
Filename :
7051256
Link To Document :
بازگشت