Title :
Learning multiple pooling combination for image classification
Author :
Hu, Junlin ; Guo, Ping
Author_Institution :
Image Process. & Pattern Recognition Lab., Beijing Normal Univ., Beijing, China
Abstract :
Recently sparse coding with spatial pyramid matching method has shown its excellent performance in image classification. Inspired by this technique, we present an image classification approach by learning the optimal Multiple Pooling Combination strategy based on Non-Negative Sparse Coding (MPC-NNSC) in this paper. First, non-negative sparse coding with three different pooling methods as well as spatial pyramid matching method are utilized to encode local descriptors for image representation, respectively. Then a promising weight learning approach is employed to find a set of optimal weights for best fusing all these pooling methods in different scales. Lastly, support vector machine classifier with linear and histogram intersection kernel is employed for the final classification task. Experiments on two popular benchmark datasets are presented and they demonstrate the better performance of the proposed scheme compared to the state-of-the-art methods.
Keywords :
image classification; image matching; image representation; learning (artificial intelligence); support vector machines; MPC-NNSC; histogram intersection kernel; image classification; image representation; linear intersection kernel; local descriptor encoding; nonnegative sparse coding; optimal multiple pooling combination strategy learning; spatial pyramid matching method; support vector machine classifier; weight learning approach; Encoding; Equations; Image coding; Image representation; Kernel; Quantization; Visualization;
Conference_Titel :
Neural Networks (IJCNN), The 2012 International Joint Conference on
Conference_Location :
Brisbane, QLD
Print_ISBN :
978-1-4673-1488-6
Electronic_ISBN :
2161-4393
DOI :
10.1109/IJCNN.2012.6252840