Abstract :
Hashing techniques have been intensively investigated in the design of highly efficient search engines for large-scale computer vision applications. Compared with prior approximate nearest neighbor search approaches like tree-based indexing, hashing-based search schemes have prominent advantages in terms of both storage and computational efficiencies. Moreover, the procedure of devising hash functions can be easily incorporated into sophisticated machine learning tools, leading to data-dependent and task-specific compact hash codes. Therefore, a number of learning paradigms, ranging from unsupervised to supervised, have been applied to compose appropriate hash functions. However, most of the existing hash function learning methods either treat hash function design as a classification problem or generate binary codes to satisfy pair wise supervision, and have not yet directly optimized the search accuracy. In this paper, we propose to leverage list wise supervision into a principled hash function learning framework. In particular, the ranking information is represented by a set of rank triplets that can be used to assess the quality of ranking. Simple linear projection-based hash functions are solved efficiently through maximizing the ranking quality over the training data. We carry out experiments on large image datasets with size up to one million and compare with the state-of-the-art hashing techniques. The extensive results corroborate that our learned hash codes via list wise supervision can provide superior search accuracy without incurring heavy computational overhead.
Keywords :
binary codes; computer vision; file organisation; image classification; image coding; search problems; unsupervised learning; approximate nearest neighbor search approaches; binary codes; data-dependent compact hash codes; hash code learning technique; hashing-based search schemes; image datasets; large-scale computer vision; linear projection-based hash functions; listwise supervision; principled hash function learning framework; rank triplets; ranking information; ranking quality; sophisticated machine learning tools; task-specific compact hash codes; tree-based indexing; Binary codes; Educational institutions; Loss measurement; Search problems; Semantics; Tensile stress; Vectors;