DocumentCode :
59947
Title :
Semantic Sparse Recoding of Visual Content for Image Applications
Author :
Zhiwu Lu ; Peng Han ; Liwei Wang ; Ji-Rong Wen
Author_Institution :
Sch. of Inf., Renmin Univ. of China, Beijing, China
Volume :
24
Issue :
1
fYear :
2015
fDate :
Jan. 2015
Firstpage :
176
Lastpage :
188
Abstract :
This paper presents a new semantic sparse recoding method to generate more descriptive and robust representation of visual content for image applications. Although the visual bag-of-words (BOW) representation has been reported to achieve promising results in different image applications, its visual codebook is completely learnt from low-level visual features using quantization techniques and thus the so-called semantic gap remains unbridgeable. To handle such challenging issue, we utilize the annotations (predicted by algorithms or shared by users) of all the images to improve the original visual BOW representation. This is further formulated as a sparse coding problem so that the noise issue induced by the inaccurate quantization of visual features can also be handled to some extent. By developing an efficient sparse coding algorithm, we successfully generate a new visual BOW representation for image applications. Since such sparse coding has actually incorporated the high-level semantic information into the original visual codebook, we thus consider it as semantic sparse recoding of the visual content. Finally, we apply our semantic sparse recoding method to automatic image annotation and social image classification. The experimental results on several benchmark datasets show the promising performance of our semantic sparse recoding method in these two image applications.
Keywords :
image classification; image coding; image representation; quantisation (signal); annotation utilization; automatic image annotation; descriptive visual content generation; image applications; low-level visual features; quantization techniques; robust visual content representation generation; semantic gap; semantic sparse recoding method; social image classification; visual BOW representation; visual bag-of-words representation; visual codebook; Encoding; Image coding; Laplace equations; Optimization; Quantization (signal); Semantics; Visualization; Visual BOW representation; automatic image annotation; semantic gap; semantic sparse recoding; social image classification; sparse coding;
fLanguage :
English
Journal_Title :
Image Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1057-7149
Type :
jour
DOI :
10.1109/TIP.2014.2375641
Filename :
6967791
Link To Document :
بازگشت