DocumentCode :
3006458
Title :
Linear spatial pyramid matching using sparse coding for image classification
Author :
Jianchao Yang ; Kai Yu ; Yihong Gong ; Huang, Tingwen
Author_Institution :
Beckman Inst., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
fYear :
2009
fDate :
20-25 June 2009
Firstpage :
1794
Lastpage :
1801
Abstract :
Recently SVMs using spatial pyramid matching (SPM) kernel have been highly successful in image classification. Despite its popularity, these nonlinear SVMs have a complexity O(n2 ~ n3) in training and O(n) in testing, where n is the training size, implying that it is nontrivial to scaleup the algorithms to handle more than thousands of training images. In this paper we develop an extension of the SPM method, by generalizing vector quantization to sparse coding followed by multi-scale spatial max pooling, and propose a linear SPM kernel based on SIFT sparse codes. This new approach remarkably reduces the complexity of SVMs to O(n) in training and a constant in testing. In a number of image categorization experiments, we find that, in terms of classification accuracy, the suggested linear SPM based on sparse coding of SIFT descriptors always significantly outperforms the linear SPM kernel on histograms, and is even better than the nonlinear SPM kernels, leading to state-of-the-art performance on several benchmarks by using a single type of descriptors.
Keywords :
computational complexity; image classification; image matching; support vector machines; vector quantisation; SIFT descriptor; SIFT sparse codes; SPM kernel; computational complexity; image categorization; image classification; linear spatial pyramid matching; multiscale spatial max pooling; nonlinear SVM; sparse coding; training images; vector quantization; Computational complexity; Histograms; Image classification; Image coding; Image representation; Image segmentation; Kernel; Scanning probe microscopy; Testing; Vector quantization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on
Conference_Location :
Miami, FL
ISSN :
1063-6919
Print_ISBN :
978-1-4244-3992-8
Type :
conf
DOI :
10.1109/CVPR.2009.5206757
Filename :
5206757
Link To Document :
بازگشت