DocumentCode :
3410199
Title :
Toward coherent object detection and scene layout understanding
Author :
Bao, Sid Ying-Ze ; Sun, Min ; Savarese, Silvio
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Michigan at Ann Arbor, Ann Arbor, MI, USA
fYear :
2010
fDate :
13-18 June 2010
Firstpage :
65
Lastpage :
72
Abstract :
Detecting objects in complex scenes while recovering the scene layout is a critical functionality in many vision-based applications. Inspired by the work of, we advocate the importance of geometric contextual reasoning for object recognition. We start from the intuition that objects´ location and pose in the 3D space are not arbitrarily distributed but rather constrained by the fact that objects must lie on one or multiple supporting surfaces. We model such supporting surfaces by means of hidden parameters (i.e. not explicitly observed) and formulate the problem of joint scene reconstruction and object recognition as the one of finding the set of parameters that maximizes the joint probability of having a number of detected objects on K supporting planes given the observations. As a key ingredient for solving this optimization problem, we have demonstrated a novel relationship between object location and pose in the image, and the scene layout parameters (i.e. normal of one or more supporting planes in 3D and camera pose, location and focal length). Using the probabilistic formulation and the above relationship our method has the unique ability to jointly: (i) reduce false alarm and false negative object detection rate; (ii) recover object location and supporting planes within the 3D camera reference system; (iii) infer camera parameters (view point and the focal length) from just one single uncalibrated image. Quantitative and qualitative experimental evaluation on a number of datasets (a novel in-house dataset and label-me on car and pedestrian) demonstrates our theoretical claims.
Keywords :
cameras; computer graphics; object detection; object recognition; optimisation; pose estimation; 3D camera reference system; 3D space; K supporting planes; camera parameters; coherent object detection; geometric contextual reasoning; inhouse dataset; joint scene reconstruction; multiple supporting surface; object recognition; optimization problem; pose estimation; scene layout parameter; uncalibrated image; vision-based application; Application software; Cameras; Computer vision; Detectors; Humans; Image reconstruction; Layout; Object detection; Object recognition; Surface reconstruction;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on
Conference_Location :
San Francisco, CA
ISSN :
1063-6919
Print_ISBN :
978-1-4244-6984-0
Type :
conf
DOI :
10.1109/CVPR.2010.5540229
Filename :
5540229
Link To Document :
بازگشت