DocumentCode
1433332
Title
Discovering Image Semantics in Codebook Derivative Space
Author
Wang, Jinjun ; Gong, Yihong
Author_Institution
Epson R&D, Inc., San Jose, CA, USA
Volume
14
Issue
4
fYear
2012
Firstpage
986
Lastpage
994
Abstract
The sparse coding based approaches for image recognition have recently shown improved performance than traditional bag-of-features technique. Due to high dimensionality of the image descriptor space, existing systems usually require very large codebook size to minimize coding error in order to get satisfactory accuracy. While most research efforts try to address the problem by constructing a relatively smaller codebook with stronger discriminative power, in this paper, we introduce an alternative solution by enhancing the quality of coding. Particularly, we apply the idea similar to Fisher kernel to the coding framework, where we use the image-dependent codebook derivative to represent the image. The proposed idea is generic across multiple coding criteria, and in this paper, it is applied to enhance the locality-constraint linear coding (LLC). Experiments show that, the extracted new feature, called “LLC+,” achieved significantly improved accuracy on several challenging datasets even with a small codebook of 1/20 the reported size used by LLC. This obviously adds to LLC+ the modeling accuracy, processing speed and codebook training advantages.
Keywords
feature extraction; image coding; image recognition; image representation; linear codes; Fisher kernel; LLC+; codebook derivative space; codebook training advantages; coding error minimization; coding quality enhancement; feature extraction; high image descriptor space dimensionality; image recognition; image representation; image semantics discovery; image-dependent codebook derivative; locality-constraint linear coding; modeling accuracy; multiple coding criteria; performance improvement; processing speed; sparse coding based approach; very large codebook size; Accuracy; Dictionaries; Encoding; Image coding; Image reconstruction; Image representation; Training; Codebook derivative; multiclass image classification; sparse-coding;
fLanguage
English
Journal_Title
Multimedia, IEEE Transactions on
Publisher
ieee
ISSN
1520-9210
Type
jour
DOI
10.1109/TMM.2012.2186120
Filename
6140978
Link To Document