DocumentCode
2718662
Title
Large-scale knowledge transfer for object localization in ImageNet
Author
Guillaumin, Matthieu ; Ferrari, Vittorio
Author_Institution
ETH Zurich, Zurich, Switzerland
fYear
2012
fDate
16-21 June 2012
Firstpage
3202
Lastpage
3209
Abstract
ImageNet is a large-scale database of object classes with millions of images. Unfortunately only a small fraction of them is manually annotated with bounding-boxes. This prevents useful developments, such as learning reliable object detectors for thousands of classes. In this paper we propose to automatically populate ImageNet with many more bounding-boxes, by leveraging existing manual annotations. The key idea is to localize objects of a target class for which annotations are not available, by transferring knowledge from related source classes with available annotations. We distinguish two kinds of source classes: ancestors and siblings. Each source provides knowledge about the plausible location, appearance and context of the target objects, which induces a probability distribution over windows in images of the target class. We learn to combine these distributions so as to maximize the location accuracy of the most probable window. Finally, we employ the combined distribution in a procedure to jointly localize objects in all images of the target class. Through experiments on 0.5 million images from 219 classes we show that our technique (i) annotates a wide range of classes with bounding-boxes; (ii) effectively exploits the hierarchical structure of ImageNet, since all sources and types of knowledge we propose contribute to the results; (iii) scales efficiently.
Keywords
knowledge management; object detection; statistical distributions; visual databases; ImageNet; ancestor source class; bounding-boxes; large-scale database; large-scale knowledge transfer; object class; object localization; probability distribution; sibling source class; target object appearance; target object context; target object location; Airplanes; Context; Prototypes; Support vector machines; Training; Vehicles; Visualization;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on
Conference_Location
Providence, RI
ISSN
1063-6919
Print_ISBN
978-1-4673-1226-4
Electronic_ISBN
1063-6919
Type
conf
DOI
10.1109/CVPR.2012.6248055
Filename
6248055
Link To Document