DocumentCode :
729732
Title :
Learning class-specific pooling shapes for image classification
Author :
Jinzhuo Wang ; Wenmin Wang ; Ronggang Wang ; Wen Gao
Author_Institution :
Shenzhen Grad. Sch., Digital Media R&D Center, Peking Univ., Shenzhen, China
fYear :
2015
fDate :
June 29 2015-July 3 2015
Firstpage :
1
Lastpage :
6
Abstract :
Spatial pyramid (SP) representation is an extension of bag-of-feature model which embeds spatial layout information of local features by pooling feature codes over pre-defined spatial shapes. However, the uniform style of spatial pooling shapes used in standard SP is an ad-hoc manner without theoretical motivation, thus lacking the generalization power to adapt to different distribution of geometric properties across image classes. In this paper, we propose a data-driven approach to adaptively learn class-specific pooling shapes (CSPS). Specifically, we first establish an over-complete set of spatial shapes providing candidates with more flexible geometric patterns. Then the optimal subset for each class is selected by training a linear classifier with structured sparsity constraint and color distribution cues. To further enhance the robust of our model, the representations over CSPS are compressed according to the shape importance and finally fed to SVM with a multi-shape matching kernel for classification task. Experimental results on three challenging datasets (Caltech-256, Scene-15 and Indoor-67) demonstrate the effectiveness of the proposed method on both object and scene images.
Keywords :
feature extraction; generalisation (artificial intelligence); image classification; image colour analysis; image matching; learning (artificial intelligence); shape recognition; CSPS; Caltech-256; Indoor-67; SVM; Scene-15; bag-of-feature model; class-specific pooling shape adaptive learning; color distribution cues; feature code pooling; generalization; geometric pattern; geometric property distribution; image classification; linear classifier training; local feature spatial layout information; multishape matching kernel; object image; predefined spatial shapes; scene image; shape importance; spatial pyramid representation; structured sparsity constraint; Encoding; Image color analysis; Kernel; Layout; Shape; Standards; Training; Image classification; class-specific pooling shapes (CSPS); multi-shape matching kernel; representation compression;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo (ICME), 2015 IEEE International Conference on
Conference_Location :
Turin
Type :
conf
DOI :
10.1109/ICME.2015.7177433
Filename :
7177433
Link To Document :
بازگشت