DocumentCode :
2508154
Title :
Three-layer Spatial Sparse Coding for Image Classification
Author :
Dai, Dengxin ; Yang, Wen ; Wu, Tianfu
Author_Institution :
Signal Process. Lab., Wuhan Univ., Wuhan, China
fYear :
2010
fDate :
23-26 Aug. 2010
Firstpage :
613
Lastpage :
616
Abstract :
In this paper, we propose a three-layer spatial sparse coding (TSSC) for image classification, aiming at three objectives: naturally recognizing image categories without learning phase, naturally involving spatial configurations of images, and naturally counteracting the intra-class variances. The method begins by representing the test images in a spatial pyramid as the to-be-recovered signals, and taking all sampled image patches at multiple scales from the labeled images as the bases. Then, three sets of coefficients are involved into the cardinal sparse coding to get the TSSC, one to penalize spatial inconsistencies of the pyramid cells and the corresponding selected bases, one to guarantee the sparsity of selected images, and the other to guarantee the sparsity of selected categories. Finally, the test images are classified according to a simple image-to-category similarity defined on the coding coefficients. In experiments, we test our method on two publicly available datasets and achieve significantly more accurate results than the conventional sparse coding with only a modest increase in computational complexity.
Keywords :
image classification; image coding; TSSC; image classification; image-to-category similarity; spatial configuration; spatial pyramid; three-layer spatial sparse coding; Artificial neural networks; Encoding; Image coding; Image reconstruction; Minimization; Pixel; Visualization; image classification; sparse coding; three-layer spatial sparse coding;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition (ICPR), 2010 20th International Conference on
Conference_Location :
Istanbul
ISSN :
1051-4651
Print_ISBN :
978-1-4244-7542-1
Type :
conf
DOI :
10.1109/ICPR.2010.155
Filename :
5597454
Link To Document :
بازگشت