• DocumentCode
    78895
  • Title

    Hashing on Nonlinear Manifolds

  • Author

    Fumin Shen ; Chunhua Shen ; Qinfeng Shi ; van den Hengel, Anton ; Zhenmin Tang ; Heng Tao Shen

  • Author_Institution
    Sch. of Comput. Sci. & Eng., Univ. of Electron. Sci. & Technol. of China, Chengdu, China
  • Volume
    24
  • Issue
    6
  • fYear
    2015
  • fDate
    Jun-15
  • Firstpage
    1839
  • Lastpage
    1851
  • Abstract
    Learning-based hashing methods have attracted considerable attention due to their ability to greatly increase the scale at which existing algorithms may operate. Most of these methods are designed to generate binary codes preserving the Euclidean similarity in the original space. Manifold learning techniques, in contrast, are better able to model the intrinsic structure embedded in the original high-dimensional data. The complexities of these models, and the problems with out-of-sample data, have previously rendered them unsuitable for application to large-scale embedding, however. In this paper, how to learn compact binary embeddings on their intrinsic manifolds is considered. In order to address the above-mentioned difficulties, an efficient, inductive solution to the out-of-sample data problem, and a process by which nonparametric manifold learning may be used as the basis of a hashing method are proposed. The proposed approach thus allows the development of a range of new hashing techniques exploiting the flexibility of the wide variety of manifold learning approaches available. It is particularly shown that hashing on the basis of t-distributed stochastic neighbor embedding outperforms state-of-the-art hashing methods on large-scale benchmark data sets, and is very effective for image classification with very short code lengths. It is shown that the proposed framework can be further improved, for example, by minimizing the quantization error with learned orthogonal rotations without much computation overhead. In addition, a supervised inductive manifold hashing framework is developed by incorporating the label information, which is shown to greatly advance the semantic retrieval performance.
  • Keywords
    image coding; image retrieval; learning (artificial intelligence); stochastic processes; Euclidean similarity; binary codes; binary embeddings; large-scale embedding; learning-based hashing method; nonlinear manifolds; nonparametric manifold learning; orthogonal rotation; quantization error; semantic retrieval; supervised inductive manifold hashing; t-distributed stochastic neighbor embedding; Binary codes; Educational institutions; Eigenvalues and eigenfunctions; Learning systems; Manifolds; Prototypes; Training; Hashing; binary code learning; image retrieval; manifold learning;
  • fLanguage
    English
  • Journal_Title
    Image Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1057-7149
  • Type

    jour

  • DOI
    10.1109/TIP.2015.2405340
  • Filename
    7047876