DocumentCode :
21353
Title :
Spatial Pooling of Heterogeneous Features for Image Classification
Author :
Lingxi Xie ; Qi Tian ; Meng Wang ; Bo Zhang
Author_Institution :
Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
Volume :
23
Issue :
5
fYear :
2014
fDate :
May-14
Firstpage :
1994
Lastpage :
2008
Abstract :
In image classification tasks, one of the most successful algorithms is the bag-of-features (BoFs) model. Although the BoF model has many advantages, such as simplicity, generality, and scalability, it still suffers from several drawbacks, including the limited semantic description of local descriptors, lack of robust structures upon single visual words, and missing of efficient spatial weighting. To overcome these shortcomings, various techniques have been proposed, such as extracting multiple descriptors, spatial context modeling, and interest region detection. Though they have been proven to improve the BoF model to some extent, there still lacks a coherent scheme to integrate each individual module together. To address the problems above, we propose a novel framework with spatial pooling of complementary features. Our model expands the traditional BoF model on three aspects. First, we propose a new scheme for combining texture and edge-based local features together at the descriptor extraction level. Next, we build geometric visual phrases to model spatial context upon complementary features for midlevel image representation. Finally, based on a smoothed edgemap, a simple and effective spatial weighting scheme is performed to capture the image saliency. We test the proposed framework on several benchmark data sets for image classification. The extensive results show the superior performance of our algorithm over the state-of-the-art methods.
Keywords :
feature extraction; image classification; image representation; image texture; vocabulary; BoF model; bag-of-features model; descriptor extraction level; edge-based local features; geometric visual phrases; heterogeneous features; image classification tasks; image saliency; local descriptors; midlevel image representation; semantic description; single visual words; spatial context modeling; spatial pooling; spatial weighting scheme; Accuracy; Feature extraction; Image edge detection; Quantization (signal); Shape; Vectors; Visualization; BoF model; Image classification; complementary descriptors; geometric phrases pooling; spatial weighting;
fLanguage :
English
Journal_Title :
Image Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1057-7149
Type :
jour
DOI :
10.1109/TIP.2014.2310117
Filename :
6757045
Link To Document :
بازگشت