Hashing on Nonlinear Manifolds

Author

Fumin Shen ; Chunhua Shen ; Qinfeng Shi ; van den Hengel, Anton ; Zhenmin Tang ; Heng Tao Shen

Author_Institution

Sch. of Comput. Sci. & Eng., Univ. of Electron. Sci. & Technol. of China, Chengdu, China

Volume

24

Issue

6

fYear

2015

fDate

Jun-15

Firstpage

1839

Lastpage

1851

Abstract

Learning-based hashing methods have attracted considerable attention due to their ability to greatly increase the scale at which existing algorithms may operate. Most of these methods are designed to generate binary codes preserving the Euclidean similarity in the original space. Manifold learning techniques, in contrast, are better able to model the intrinsic structure embedded in the original high-dimensional data. The complexities of these models, and the problems with out-of-sample data, have previously rendered them unsuitable for application to large-scale embedding, however. In this paper, how to learn compact binary embeddings on their intrinsic manifolds is considered. In order to address the above-mentioned difficulties, an efficient, inductive solution to the out-of-sample data problem, and a process by which nonparametric manifold learning may be used as the basis of a hashing method are proposed. The proposed approach thus allows the development of a range of new hashing techniques exploiting the flexibility of the wide variety of manifold learning approaches available. It is particularly shown that hashing on the basis of t-distributed stochastic neighbor embedding outperforms state-of-the-art hashing methods on large-scale benchmark data sets, and is very effective for image classification with very short code lengths. It is shown that the proposed framework can be further improved, for example, by minimizing the quantization error with learned orthogonal rotations without much computation overhead. In addition, a supervised inductive manifold hashing framework is developed by incorporating the label information, which is shown to greatly advance the semantic retrieval performance.

Keywords

image coding; image retrieval; learning (artificial intelligence); stochastic processes; Euclidean similarity; binary codes; binary embeddings; large-scale embedding; learning-based hashing method; nonlinear manifolds; nonparametric manifold learning; orthogonal rotation; quantization error; semantic retrieval; supervised inductive manifold hashing; t-distributed stochastic neighbor embedding; Binary codes; Educational institutions; Eigenvalues and eigenfunctions; Learning systems; Manifolds; Prototypes; Training; Hashing; binary code learning; image retrieval; manifold learning;

fLanguage

English

Journal_Title

Image Processing, IEEE Transactions on

Publisher

ieee

ISSN

1057-7149

Type

jour

DOI

10.1109/TIP.2015.2405340

Filename

7047876