DocumentCode :
2719283
Title :
A codebook-free and annotation-free approach for fine-grained image categorization
Author :
Yao, Bangpeng ; Bradski, Gary ; Fei-Fei, Li
Author_Institution :
Comput. Sci. Dept., Stanford Univ., Stanford, CA, USA
fYear :
2012
fDate :
16-21 June 2012
Firstpage :
3466
Lastpage :
3473
Abstract :
Fine-grained categorization refers to the task of classifying objects that belong to the same basic-level class (e.g. different bird species) and share similar shape or visual appearances. Most of the state-of-the-art basic-level object classification algorithms have difficulties in this challenging problem. One reason for this can be attributed to the popular codebook-based image representation, often resulting in loss of subtle image information that are critical for fine-grained classification. Another way to address this problem is to introduce human annotations of object attributes or key points, a tedious process that is also difficult to generalize to new tasks. In this work, we propose a codebook-free and annotation-free approach for fine-grained image categorization. Instead of using vector-quantized codewords, we obtain an image representation by running a high throughput template matching process using a large number of randomly generated image templates. We then propose a novel bagging-based algorithm to build a final classifier by aggregating a set of discriminative yet largely uncorrelated classifiers. Experimental results show that our method outperforms state-of-the-art classification approaches on the Caltech-UCSD Birds dataset.
Keywords :
image classification; image matching; image representation; Caltech-UCSD Birds dataset; annotation-free approach; bagging-based algorithm; basic-level class; basic-level object classification algorithms; codebook-based image representation; codebook-free approach; final classifier; fine-grained classification; fine-grained image categorization; human annotations; image template generation; key points; object attributes; template matching process; Birds; Humans; Image color analysis; Image representation; Training; Vectors; Visualization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on
Conference_Location :
Providence, RI
ISSN :
1063-6919
Print_ISBN :
978-1-4673-1226-4
Electronic_ISBN :
1063-6919
Type :
conf
DOI :
10.1109/CVPR.2012.6248088
Filename :
6248088
Link To Document :
بازگشت