DocumentCode
80469
Title
Visual Words Assignment Via Information-Theoretic Manifold Embedding
Author
Yue Deng ; Yipeng Li ; Yanjun Qian ; Xiangyang Ji ; Qionghai Dai
Author_Institution
Dept. of Autom., Tsinghua Univ., Beijing, China
Volume
44
Issue
10
fYear
2014
fDate
Oct. 2014
Firstpage
1924
Lastpage
1937
Abstract
Codebook-based learning provides a flexible way to extract the contents of an image in a data-driven manner for visual recognition. One central task in such frameworks is codeword assignment, which allocates local image descriptors to the most similar codewords in the dictionary to generate histogram for categorization. Nevertheless, existing assignment approaches, e.g., nearest neighbors strategy (hard assignment) and Gaussian similarity (soft assignment), suffer from two problems: 1) too strong Euclidean assumption and 2) neglecting the label information of the local descriptors. To address the aforementioned two challenges, we propose a graph assignment method with maximal mutual information (GAMI) regularization. GAMI takes the power of manifold structure to better reveal the relationship of massive number of local features by nonlinear graph metric. Meanwhile, the mutual information of descriptor-label pairs is ultimately optimized in the embedding space for the sake of enhancing the discriminant property of the selected codewords. According to such objective, two optimization models, i.e., inexact-GAMI and exact-GAMI, are respectively proposed in this paper. The inexact model can be efficiently solved with a closed-from solution. The stricter exact-GAMI nonparametrically estimates the entropy of descriptor-label pairs in the embedding space and thus leads to a relatively complicated but still trackable optimization. The effectiveness of GAMI models are verified on both the public and our own datasets.
Keywords
computer vision; feature extraction; graph theory; image recognition; learning (artificial intelligence); Euclidean assumption; GAMI regularization; Gaussian similarity; closed-from solution; codebook-based learning; codeword assignment; descriptor-label pairs; embedding space; exact-GAMI model; graph assignment method with maximal mutual information; image categorization; image contents extraction; image descriptors; inexact-GAMI model; information-theoretic manifold embedding; nearest neighbors strategy; nonlinear graph metric; visual recognition; visual words assignment; Entropy; Histograms; Manifolds; Measurement; Mutual information; Optimization; Visualization; Manifold embedding; mutual information; scene categorization; visual words assignment;
fLanguage
English
Journal_Title
Cybernetics, IEEE Transactions on
Publisher
ieee
ISSN
2168-2267
Type
jour
DOI
10.1109/TCYB.2014.2300192
Filename
6727495
Link To Document