مرکز منطقه ای اطلاع رساني علوم و فناوري - Large-scale knowledge transfer for object localization in ImageNet

DocumentCode :

2718662

Title :

Large-scale knowledge transfer for object localization in ImageNet

Author :

Guillaumin, Matthieu ; Ferrari, Vittorio

Author_Institution :

ETH Zurich, Zurich, Switzerland

fYear :

2012

fDate :

16-21 June 2012

Firstpage :

3202

Lastpage :

3209

Abstract :

ImageNet is a large-scale database of object classes with millions of images. Unfortunately only a small fraction of them is manually annotated with bounding-boxes. This prevents useful developments, such as learning reliable object detectors for thousands of classes. In this paper we propose to automatically populate ImageNet with many more bounding-boxes, by leveraging existing manual annotations. The key idea is to localize objects of a target class for which annotations are not available, by transferring knowledge from related source classes with available annotations. We distinguish two kinds of source classes: ancestors and siblings. Each source provides knowledge about the plausible location, appearance and context of the target objects, which induces a probability distribution over windows in images of the target class. We learn to combine these distributions so as to maximize the location accuracy of the most probable window. Finally, we employ the combined distribution in a procedure to jointly localize objects in all images of the target class. Through experiments on 0.5 million images from 219 classes we show that our technique (i) annotates a wide range of classes with bounding-boxes; (ii) effectively exploits the hierarchical structure of ImageNet, since all sources and types of knowledge we propose contribute to the results; (iii) scales efficiently.

Keywords :

knowledge management; object detection; statistical distributions; visual databases; ImageNet; ancestor source class; bounding-boxes; large-scale database; large-scale knowledge transfer; object class; object localization; probability distribution; sibling source class; target object appearance; target object context; target object location; Airplanes; Context; Prototypes; Support vector machines; Training; Vehicles; Visualization;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on

Conference_Location :

Providence, RI

ISSN :

1063-6919

Print_ISBN :

978-1-4673-1226-4

Electronic_ISBN :

1063-6919

Type :

conf

DOI :

10.1109/CVPR.2012.6248055

Filename :

6248055

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2718662