Title :
Predicting an Object Location Using a Global Image Representation
Author :
Serrano, Jose A. Rodriguez ; Larlus, Diane
Abstract :
We tackle the detection of prominent objects in images as a retrieval task: given a global image descriptor, we find the most similar images in an annotated dataset, and transfer the object bounding boxes. We refer to this approach as data driven detection (DDD), that is an alternative to sliding windows. Previous works have used similar notions but with task-independent similarities and representations, i.e. they were not tailored to the end-goal of localization. This article proposes two contributions: (i) a metric learning algorithm and (ii) a representation of images as object probability maps, that are both optimized for detection. We show experimentally that these two contributions are crucial to DDD, do not require costly additional operations, and in some cases yield comparable or better results than state-of-the-art detectors despite conceptual simplicity and increased speed. As an application of prominent object detection, we improve fine-grained categorization by precropping images with the proposed approach.
Keywords :
image representation; image retrieval; object detection; probability; DDD; data driven detection; fine-grained categorization; global image representation; image retrieval; metric learning algorithm; object bounding boxes; object detection; object location prediction; object probability map; precropping images; Databases; Feature extraction; Image representation; Image segmentation; Measurement; Training; Vectors; Fisher vectors; fine-grained categorization; image retrieval; metric learning; object detection;
Conference_Titel :
Computer Vision (ICCV), 2013 IEEE International Conference on
Conference_Location :
Sydney, NSW
DOI :
10.1109/ICCV.2013.217