DocumentCode :
3405081
Title :
Supervised translation-invariant sparse coding
Author :
Yang, Jianchao ; Yu, Kai ; Huang, Thomas
Author_Institution :
Beckman Inst., Univ. of Illinois at Urbana-Champaign, Champaign, IL, USA
fYear :
2010
fDate :
13-18 June 2010
Firstpage :
3517
Lastpage :
3524
Abstract :
In this paper, we propose a novel supervised hierarchical sparse coding model based on local image descriptors for classification tasks. The supervised dictionary training is performed via back-projection, by minimizing the training error of classifying the image level features, which are extracted by max pooling over the sparse codes within a spatial pyramid. Such a max pooling procedure across multiple spatial scales offer the model translation invariant properties, similar to the Convolutional Neural Network (CNN). Experiments show that our supervised dictionary improves the performance of the proposed model significantly over the unsupervised dictionary, leading to state-of-the-art performance on diverse image databases. Further more, our supervised model targets learning linear features, implying its great potential in handling large scale datasets in real applications.
Keywords :
image classification; image coding; neural nets; backprojection; classification tasks; convolutional neural network; image level features; linear features learning; local image descriptors; max pooling procedure; sparse codes; spatial pyramid; supervised dictionary training; supervised hierarchical sparse coding model; supervised translation-invariant sparse coding; training error; unsupervised dictionary; Convolution; Convolutional codes; Dictionaries; Feature extraction; Image classification; Image coding; Image reconstruction; Predictive models; Support vector machine classification; Support vector machines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on
Conference_Location :
San Francisco, CA
ISSN :
1063-6919
Print_ISBN :
978-1-4244-6984-0
Type :
conf
DOI :
10.1109/CVPR.2010.5539958
Filename :
5539958
Link To Document :
بازگشت