Semantic Sparse Recoding of Visual Content for Image Applications

Author

Zhiwu Lu ; Peng Han ; Liwei Wang ; Ji-Rong Wen

Author_Institution

Sch. of Inf., Renmin Univ. of China, Beijing, China

Volume

24

Issue

1

fYear

2015

fDate

Jan. 2015

Firstpage

176

Lastpage

188

Abstract

This paper presents a new semantic sparse recoding method to generate more descriptive and robust representation of visual content for image applications. Although the visual bag-of-words (BOW) representation has been reported to achieve promising results in different image applications, its visual codebook is completely learnt from low-level visual features using quantization techniques and thus the so-called semantic gap remains unbridgeable. To handle such challenging issue, we utilize the annotations (predicted by algorithms or shared by users) of all the images to improve the original visual BOW representation. This is further formulated as a sparse coding problem so that the noise issue induced by the inaccurate quantization of visual features can also be handled to some extent. By developing an efficient sparse coding algorithm, we successfully generate a new visual BOW representation for image applications. Since such sparse coding has actually incorporated the high-level semantic information into the original visual codebook, we thus consider it as semantic sparse recoding of the visual content. Finally, we apply our semantic sparse recoding method to automatic image annotation and social image classification. The experimental results on several benchmark datasets show the promising performance of our semantic sparse recoding method in these two image applications.

Keywords

image classification; image coding; image representation; quantisation (signal); annotation utilization; automatic image annotation; descriptive visual content generation; image applications; low-level visual features; quantization techniques; robust visual content representation generation; semantic gap; semantic sparse recoding method; social image classification; visual BOW representation; visual bag-of-words representation; visual codebook; Encoding; Image coding; Laplace equations; Optimization; Quantization (signal); Semantics; Visualization; Visual BOW representation; automatic image annotation; semantic gap; semantic sparse recoding; social image classification; sparse coding;

fLanguage

English

Journal_Title

Image Processing, IEEE Transactions on

Publisher

ieee

ISSN

1057-7149

Type

jour

DOI

10.1109/TIP.2014.2375641

Filename

6967791