DocumentCode :
3424757
Title :
Predicting an Object Location Using a Global Image Representation
Author :
Serrano, Jose A. Rodriguez ; Larlus, Diane
fYear :
2013
fDate :
1-8 Dec. 2013
Firstpage :
1729
Lastpage :
1736
Abstract :
We tackle the detection of prominent objects in images as a retrieval task: given a global image descriptor, we find the most similar images in an annotated dataset, and transfer the object bounding boxes. We refer to this approach as data driven detection (DDD), that is an alternative to sliding windows. Previous works have used similar notions but with task-independent similarities and representations, i.e. they were not tailored to the end-goal of localization. This article proposes two contributions: (i) a metric learning algorithm and (ii) a representation of images as object probability maps, that are both optimized for detection. We show experimentally that these two contributions are crucial to DDD, do not require costly additional operations, and in some cases yield comparable or better results than state-of-the-art detectors despite conceptual simplicity and increased speed. As an application of prominent object detection, we improve fine-grained categorization by precropping images with the proposed approach.
Keywords :
image representation; image retrieval; object detection; probability; DDD; data driven detection; fine-grained categorization; global image representation; image retrieval; metric learning algorithm; object bounding boxes; object detection; object location prediction; object probability map; precropping images; Databases; Feature extraction; Image representation; Image segmentation; Measurement; Training; Vectors; Fisher vectors; fine-grained categorization; image retrieval; metric learning; object detection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision (ICCV), 2013 IEEE International Conference on
Conference_Location :
Sydney, NSW
ISSN :
1550-5499
Type :
conf
DOI :
10.1109/ICCV.2013.217
Filename :
6751325
Link To Document :
بازگشت