DocumentCode :
3019366
Title :
Semantic structure from motion with object and point interactions
Author :
Bao, Sid Yingze ; Bagra, Mohit ; Savarese, Silvio
Author_Institution :
Univ. of Michigan at Ann Arbor, Ann Arbor, MI, USA
fYear :
2011
fDate :
6-13 Nov. 2011
Firstpage :
982
Lastpage :
989
Abstract :
We propose a new method for jointly detecting objects and recovering the geometry of the scene (camera pose, object and scene point 3D locations) from multiple semi-calibrated images (camera internal parameters are known). To achieve this task, our method models high level semantics (i.e. object class labels and relevant characteristics such as location and pose) and the interaction (correlations) of objects and feature points within the same view and across views. We validate our algorithm against state-of-the-art baseline methods using two public datasets - Ford Car dataset and Kinect Office dataset [1] - and show that we: i) significantly improve the camera pose estimation results compared to point-based SFM algorithm; ii) achieve better 2D and 3D object detection accuracy than using single images separately. Our algorithm is critical in many application scenarios including object manipulation and autonomous navigation.
Keywords :
image motion analysis; image reconstruction; object detection; pose estimation; 2D object detection; 3D object detection; Ford Car dataset; Kinect Office dataset; camera pose estimation; multiple semicalibrated images; point-based SFM algorithm; scene geometry recovery; semantic structure; structure from motion; Cameras; Correlation; Feature extraction; Object detection; Semantics; Three dimensional displays; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision Workshops (ICCV Workshops), 2011 IEEE International Conference on
Conference_Location :
Barcelona
Print_ISBN :
978-1-4673-0062-9
Type :
conf
DOI :
10.1109/ICCVW.2011.6130358
Filename :
6130358
Link To Document :
بازگشت