• DocumentCode
    80469
  • Title

    Visual Words Assignment Via Information-Theoretic Manifold Embedding

  • Author

    Yue Deng ; Yipeng Li ; Yanjun Qian ; Xiangyang Ji ; Qionghai Dai

  • Author_Institution
    Dept. of Autom., Tsinghua Univ., Beijing, China
  • Volume
    44
  • Issue
    10
  • fYear
    2014
  • fDate
    Oct. 2014
  • Firstpage
    1924
  • Lastpage
    1937
  • Abstract
    Codebook-based learning provides a flexible way to extract the contents of an image in a data-driven manner for visual recognition. One central task in such frameworks is codeword assignment, which allocates local image descriptors to the most similar codewords in the dictionary to generate histogram for categorization. Nevertheless, existing assignment approaches, e.g., nearest neighbors strategy (hard assignment) and Gaussian similarity (soft assignment), suffer from two problems: 1) too strong Euclidean assumption and 2) neglecting the label information of the local descriptors. To address the aforementioned two challenges, we propose a graph assignment method with maximal mutual information (GAMI) regularization. GAMI takes the power of manifold structure to better reveal the relationship of massive number of local features by nonlinear graph metric. Meanwhile, the mutual information of descriptor-label pairs is ultimately optimized in the embedding space for the sake of enhancing the discriminant property of the selected codewords. According to such objective, two optimization models, i.e., inexact-GAMI and exact-GAMI, are respectively proposed in this paper. The inexact model can be efficiently solved with a closed-from solution. The stricter exact-GAMI nonparametrically estimates the entropy of descriptor-label pairs in the embedding space and thus leads to a relatively complicated but still trackable optimization. The effectiveness of GAMI models are verified on both the public and our own datasets.
  • Keywords
    computer vision; feature extraction; graph theory; image recognition; learning (artificial intelligence); Euclidean assumption; GAMI regularization; Gaussian similarity; closed-from solution; codebook-based learning; codeword assignment; descriptor-label pairs; embedding space; exact-GAMI model; graph assignment method with maximal mutual information; image categorization; image contents extraction; image descriptors; inexact-GAMI model; information-theoretic manifold embedding; nearest neighbors strategy; nonlinear graph metric; visual recognition; visual words assignment; Entropy; Histograms; Manifolds; Measurement; Mutual information; Optimization; Visualization; Manifold embedding; mutual information; scene categorization; visual words assignment;
  • fLanguage
    English
  • Journal_Title
    Cybernetics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    2168-2267
  • Type

    jour

  • DOI
    10.1109/TCYB.2014.2300192
  • Filename
    6727495