Title :
Semantic structure from motion with object and point interactions
Author :
Bao, Sid Yingze ; Bagra, Mohit ; Savarese, Silvio
Author_Institution :
Univ. of Michigan at Ann Arbor, Ann Arbor, MI, USA
Abstract :
We propose a new method for jointly detecting objects and recovering the geometry of the scene (camera pose, object and scene point 3D locations) from multiple semi-calibrated images (camera internal parameters are known). To achieve this task, our method models high level semantics (i.e. object class labels and relevant characteristics such as location and pose) and the interaction (correlations) of objects and feature points within the same view and across views. We validate our algorithm against state-of-the-art baseline methods using two public datasets - Ford Car dataset and Kinect Office dataset [1] - and show that we: i) significantly improve the camera pose estimation results compared to point-based SFM algorithm; ii) achieve better 2D and 3D object detection accuracy than using single images separately. Our algorithm is critical in many application scenarios including object manipulation and autonomous navigation.
Keywords :
image motion analysis; image reconstruction; object detection; pose estimation; 2D object detection; 3D object detection; Ford Car dataset; Kinect Office dataset; camera pose estimation; multiple semicalibrated images; point-based SFM algorithm; scene geometry recovery; semantic structure; structure from motion; Cameras; Correlation; Feature extraction; Object detection; Semantics; Three dimensional displays; Training;
Conference_Titel :
Computer Vision Workshops (ICCV Workshops), 2011 IEEE International Conference on
Conference_Location :
Barcelona
Print_ISBN :
978-1-4673-0062-9
DOI :
10.1109/ICCVW.2011.6130358