مرکز منطقه ای اطلاع رساني علوم و فناوري - Weakly Supervised Visual Dictionary Learning by Harnessing Image Attributes

DocumentCode :

17410

Title :

Weakly Supervised Visual Dictionary Learning by Harnessing Image Attributes

Author :

Yue Gao ; Rongrong Ji ; Wei Liu ; Qionghai Dai ; Gang Hua

Author_Institution :

Dept. of AutomationTsinghua, Tsinghua Univ., Beijing, China

Volume :

Issue :

fYear :

2014

fDate :

Dec. 2014

Firstpage :

5400

Lastpage :

5411

Abstract :

Bag-of-features (BoFs) representation has been extensively applied to deal with various computer vision applications. To extract discriminative and descriptive BoF, one important step is to learn a good dictionary to minimize the quantization loss between local features and codewords. While most existing visual dictionary learning approaches are engaged with unsupervised feature quantization, the latest trend has turned to supervised learning by harnessing the semantic labels of images or regions. However, such labels are typically too expensive to acquire, which restricts the scalability of supervised dictionary learning approaches. In this paper, we propose to leverage image attributes to weakly supervise the dictionary learning procedure without requiring any actual labels. As a key contribution, our approach establishes a generative hidden Markov random field (HMRF), which models the quantized codewords as the observed states and the image attributes as the hidden states, respectively. Dictionary learning is then performed by supervised grouping the observed states, where the supervised information is stemmed from the hidden states of the HMRF. In such a way, the proposed dictionary learning approach incorporates the image attributes to learn a semantic-preserving BoF representation without any genuine supervision. Experiments in large-scale image retrieval and classification tasks corroborate that our approach significantly outperforms the state-of-the-art unsupervised dictionary learning approaches.

Keywords :

dictionaries; hidden Markov models; image classification; image coding; image representation; image retrieval; learning (artificial intelligence); random processes; HMRF; bag-of-feature representation; codeword quantization; computer vision application; descriptive BoF extraction; discriminative BoF extraction; generative hidden Markov random field; image attribute harnessing; image classification; image retrieval; image semantic label harnessing; quantization loss; semantic-preserving BoF representation; unsupervised dictionary learning approach; unsupervised feature quantization; weakly supervised visual dictionary learning approach; Detectors; Dictionaries; Feature extraction; Hidden Markov models; Quantization (signal); Semantics; Visualization; Bag-of-Features; Bag-of-features; Hidden Markov Random Field; hidden Markov random field; image attribute; image classification; image search; visual dictionary; weakly supervised learning;

fLanguage :

English

Journal_Title :

Image Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1057-7149

Type :

jour

DOI :

10.1109/TIP.2014.2364536

Filename :

6939679

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=17410