• DocumentCode
    59947
  • Title

    Semantic Sparse Recoding of Visual Content for Image Applications

  • Author

    Zhiwu Lu ; Peng Han ; Liwei Wang ; Ji-Rong Wen

  • Author_Institution
    Sch. of Inf., Renmin Univ. of China, Beijing, China
  • Volume
    24
  • Issue
    1
  • fYear
    2015
  • fDate
    Jan. 2015
  • Firstpage
    176
  • Lastpage
    188
  • Abstract
    This paper presents a new semantic sparse recoding method to generate more descriptive and robust representation of visual content for image applications. Although the visual bag-of-words (BOW) representation has been reported to achieve promising results in different image applications, its visual codebook is completely learnt from low-level visual features using quantization techniques and thus the so-called semantic gap remains unbridgeable. To handle such challenging issue, we utilize the annotations (predicted by algorithms or shared by users) of all the images to improve the original visual BOW representation. This is further formulated as a sparse coding problem so that the noise issue induced by the inaccurate quantization of visual features can also be handled to some extent. By developing an efficient sparse coding algorithm, we successfully generate a new visual BOW representation for image applications. Since such sparse coding has actually incorporated the high-level semantic information into the original visual codebook, we thus consider it as semantic sparse recoding of the visual content. Finally, we apply our semantic sparse recoding method to automatic image annotation and social image classification. The experimental results on several benchmark datasets show the promising performance of our semantic sparse recoding method in these two image applications.
  • Keywords
    image classification; image coding; image representation; quantisation (signal); annotation utilization; automatic image annotation; descriptive visual content generation; image applications; low-level visual features; quantization techniques; robust visual content representation generation; semantic gap; semantic sparse recoding method; social image classification; visual BOW representation; visual bag-of-words representation; visual codebook; Encoding; Image coding; Laplace equations; Optimization; Quantization (signal); Semantics; Visualization; Visual BOW representation; automatic image annotation; semantic gap; semantic sparse recoding; social image classification; sparse coding;
  • fLanguage
    English
  • Journal_Title
    Image Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1057-7149
  • Type

    jour

  • DOI
    10.1109/TIP.2014.2375641
  • Filename
    6967791