Title :
Supervised translation-invariant sparse coding
Author :
Yang, Jianchao ; Yu, Kai ; Huang, Thomas
Author_Institution :
Beckman Inst., Univ. of Illinois at Urbana-Champaign, Champaign, IL, USA
Abstract :
In this paper, we propose a novel supervised hierarchical sparse coding model based on local image descriptors for classification tasks. The supervised dictionary training is performed via back-projection, by minimizing the training error of classifying the image level features, which are extracted by max pooling over the sparse codes within a spatial pyramid. Such a max pooling procedure across multiple spatial scales offer the model translation invariant properties, similar to the Convolutional Neural Network (CNN). Experiments show that our supervised dictionary improves the performance of the proposed model significantly over the unsupervised dictionary, leading to state-of-the-art performance on diverse image databases. Further more, our supervised model targets learning linear features, implying its great potential in handling large scale datasets in real applications.
Keywords :
image classification; image coding; neural nets; backprojection; classification tasks; convolutional neural network; image level features; linear features learning; local image descriptors; max pooling procedure; sparse codes; spatial pyramid; supervised dictionary training; supervised hierarchical sparse coding model; supervised translation-invariant sparse coding; training error; unsupervised dictionary; Convolution; Convolutional codes; Dictionaries; Feature extraction; Image classification; Image coding; Image reconstruction; Predictive models; Support vector machine classification; Support vector machines;
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on
Conference_Location :
San Francisco, CA
Print_ISBN :
978-1-4244-6984-0
DOI :
10.1109/CVPR.2010.5539958