DocumentCode :
3332626
Title :
Mesh Based Semantic Modelling for Indoor and Outdoor Scenes
Author :
Valentin, Julien P. C. ; Sengupta, Sabyasachi ; Warrell, J. ; Shahrokni, Ali ; Torr, Philip H. S.
fYear :
2013
fDate :
23-28 June 2013
Firstpage :
2067
Lastpage :
2074
Abstract :
Semantic reconstruction of a scene is important for a variety of applications such as 3D modelling, object recognition and autonomous robotic navigation. However, most object labelling methods work in the image domain and fail to capture the information present in 3D space. In this work we propose a principled way to generate object labelling in 3D. Our method builds a triangulated meshed representation of the scene from multiple depth estimates. We then define a CRF over this mesh, which is able to capture the consistency of geometric properties of the objects present in the scene. In this framework, we are able to generate object hypotheses by combining information from multiple sources: geometric properties (from the 3D mesh), and appearance properties (from images). We demonstrate the robustness of our framework in both indoor and outdoor scenes. For indoor scenes we created an augmented version of the NYU indoor scene dataset (RGBD images) with object labelled meshes for training and evaluation. For outdoor scenes, we created ground truth object labellings for the KITTY odometry dataset (stereo image sequence). We observe a significant speed-up in the inference stage by performing labelling on the mesh, and additionally achieve higher accuracies.
Keywords :
image colour analysis; image reconstruction; image representation; image sequences; solid modelling; stereo image processing; 3D mesh; 3D space; CRF; KITTY odometry dataset; NYU indoor scene dataset; RGBD images; appearance properties; geometric properties; ground truth object labellings; indoor scenes; inference stage; mesh based semantic modelling; object hypothesis generation; object labelling methods; outdoor scenes; semantic reconstruction; stereo image sequence; triangulated meshed representation; Cameras; Feature extraction; Image reconstruction; Labeling; Semantics; Three-dimensional displays; Training; 3d scene understanding;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on
Conference_Location :
Portland, OR
ISSN :
1063-6919
Type :
conf
DOI :
10.1109/CVPR.2013.269
Filename :
6619113
Link To Document :
بازگشت