DocumentCode :
17410
Title :
Weakly Supervised Visual Dictionary Learning by Harnessing Image Attributes
Author :
Yue Gao ; Rongrong Ji ; Wei Liu ; Qionghai Dai ; Gang Hua
Author_Institution :
Dept. of AutomationTsinghua, Tsinghua Univ., Beijing, China
Volume :
23
Issue :
12
fYear :
2014
fDate :
Dec. 2014
Firstpage :
5400
Lastpage :
5411
Abstract :
Bag-of-features (BoFs) representation has been extensively applied to deal with various computer vision applications. To extract discriminative and descriptive BoF, one important step is to learn a good dictionary to minimize the quantization loss between local features and codewords. While most existing visual dictionary learning approaches are engaged with unsupervised feature quantization, the latest trend has turned to supervised learning by harnessing the semantic labels of images or regions. However, such labels are typically too expensive to acquire, which restricts the scalability of supervised dictionary learning approaches. In this paper, we propose to leverage image attributes to weakly supervise the dictionary learning procedure without requiring any actual labels. As a key contribution, our approach establishes a generative hidden Markov random field (HMRF), which models the quantized codewords as the observed states and the image attributes as the hidden states, respectively. Dictionary learning is then performed by supervised grouping the observed states, where the supervised information is stemmed from the hidden states of the HMRF. In such a way, the proposed dictionary learning approach incorporates the image attributes to learn a semantic-preserving BoF representation without any genuine supervision. Experiments in large-scale image retrieval and classification tasks corroborate that our approach significantly outperforms the state-of-the-art unsupervised dictionary learning approaches.
Keywords :
dictionaries; hidden Markov models; image classification; image coding; image representation; image retrieval; learning (artificial intelligence); random processes; HMRF; bag-of-feature representation; codeword quantization; computer vision application; descriptive BoF extraction; discriminative BoF extraction; generative hidden Markov random field; image attribute harnessing; image classification; image retrieval; image semantic label harnessing; quantization loss; semantic-preserving BoF representation; unsupervised dictionary learning approach; unsupervised feature quantization; weakly supervised visual dictionary learning approach; Detectors; Dictionaries; Feature extraction; Hidden Markov models; Quantization (signal); Semantics; Visualization; Bag-of-Features; Bag-of-features; Hidden Markov Random Field; hidden Markov random field; image attribute; image classification; image search; visual dictionary; weakly supervised learning;
fLanguage :
English
Journal_Title :
Image Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1057-7149
Type :
jour
DOI :
10.1109/TIP.2014.2364536
Filename :
6939679
Link To Document :
بازگشت