DocumentCode :
2719558
Title :
Weakly supervised sparse coding with geometric consistency pooling
Author :
Cao, Liujuan ; Ji, Rongrong ; Gao, Yue ; Yang, Yi ; Tian, Qi
fYear :
2012
fDate :
16-21 June 2012
Firstpage :
3578
Lastpage :
3585
Abstract :
Most recently the Bag-of-Features (BoF) representation has been well advocated for image search and classification, with two decent phases named sparse coding and max pooling to compensate quantization loss as well as inject spatial layouts. But still, much information has been discarded by quantizing local descriptors with two-dimensional layouts into a one-dimensional BoF histogram. In this paper, we revisit this popular “sparse coding + max pooling” paradigm by “looking around” the local descriptor context towards an optimal BoF. First, we introduce a Weakly supervised Sparse Coding (WSC) to exploit the Classemes-based attribute labeling to refine the descriptor coding procedure. It is achieved by learning an attribute-to-word co-occurrence prior to impose a label inconsistency distortion over the ℓ1 based coding regularizer, such that the descriptor codes can maximally preserve the image semantic similarity. Second, we propose an adaptive feature pooling scheme over “superpixels” rather than over fixed spatial pyramids, named Geometric Consistency Pooling (GCP). As an effect, local descriptors enjoying good geometric consistency are pooled together to ensure a more precise spatial layouts embedding in BoF. Both of our phases are unsupervised, which differ from the existing works in supervised dictionary learning, sparse coding and feature pooling. Therefore, our approach enables potential applications like scalable visual search. We evaluate in both image classification and search benchmarks and report good improvements over the state-of-the-arts.
Keywords :
image classification; image coding; learning (artificial intelligence); search problems; ℓ1 based coding regularizer; BoF representation; Classemes-based attribute labeling; GCP; WSC; adaptive feature pooling scheme; attribute-to-word cooccurrence; bag-of-features; descriptor coding procedure; fixed spatial pyramid; geometric consistency pooling; image classification; image search; image semantic similarity; max pooling; one-dimensional BoF histogram; quantization loss; scalable visual search; spatial layouts embedding; supervised dictionary learning; weakly supervised sparse coding; Dictionaries; Encoding; Equations; Image coding; Image segmentation; Layout; Semantics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on
Conference_Location :
Providence, RI
ISSN :
1063-6919
Print_ISBN :
978-1-4673-1226-4
Electronic_ISBN :
1063-6919
Type :
conf
DOI :
10.1109/CVPR.2012.6248102
Filename :
6248102
Link To Document :
بازگشت