• DocumentCode
    17410
  • Title

    Weakly Supervised Visual Dictionary Learning by Harnessing Image Attributes

  • Author

    Yue Gao ; Rongrong Ji ; Wei Liu ; Qionghai Dai ; Gang Hua

  • Author_Institution
    Dept. of AutomationTsinghua, Tsinghua Univ., Beijing, China
  • Volume
    23
  • Issue
    12
  • fYear
    2014
  • fDate
    Dec. 2014
  • Firstpage
    5400
  • Lastpage
    5411
  • Abstract
    Bag-of-features (BoFs) representation has been extensively applied to deal with various computer vision applications. To extract discriminative and descriptive BoF, one important step is to learn a good dictionary to minimize the quantization loss between local features and codewords. While most existing visual dictionary learning approaches are engaged with unsupervised feature quantization, the latest trend has turned to supervised learning by harnessing the semantic labels of images or regions. However, such labels are typically too expensive to acquire, which restricts the scalability of supervised dictionary learning approaches. In this paper, we propose to leverage image attributes to weakly supervise the dictionary learning procedure without requiring any actual labels. As a key contribution, our approach establishes a generative hidden Markov random field (HMRF), which models the quantized codewords as the observed states and the image attributes as the hidden states, respectively. Dictionary learning is then performed by supervised grouping the observed states, where the supervised information is stemmed from the hidden states of the HMRF. In such a way, the proposed dictionary learning approach incorporates the image attributes to learn a semantic-preserving BoF representation without any genuine supervision. Experiments in large-scale image retrieval and classification tasks corroborate that our approach significantly outperforms the state-of-the-art unsupervised dictionary learning approaches.
  • Keywords
    dictionaries; hidden Markov models; image classification; image coding; image representation; image retrieval; learning (artificial intelligence); random processes; HMRF; bag-of-feature representation; codeword quantization; computer vision application; descriptive BoF extraction; discriminative BoF extraction; generative hidden Markov random field; image attribute harnessing; image classification; image retrieval; image semantic label harnessing; quantization loss; semantic-preserving BoF representation; unsupervised dictionary learning approach; unsupervised feature quantization; weakly supervised visual dictionary learning approach; Detectors; Dictionaries; Feature extraction; Hidden Markov models; Quantization (signal); Semantics; Visualization; Bag-of-Features; Bag-of-features; Hidden Markov Random Field; hidden Markov random field; image attribute; image classification; image search; visual dictionary; weakly supervised learning;
  • fLanguage
    English
  • Journal_Title
    Image Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1057-7149
  • Type

    jour

  • DOI
    10.1109/TIP.2014.2364536
  • Filename
    6939679