DocumentCode :
3328876
Title :
Perceptual Organization and Recognition of Indoor Scenes from RGB-D Images
Author :
Gupta, Swastik ; Arbelaez, Pablo ; Malik, Jagannath
Author_Institution :
Univ. of California, Berkeley, Berkeley, CA, USA
fYear :
2013
fDate :
23-28 June 2013
Firstpage :
564
Lastpage :
571
Abstract :
We address the problems of contour detection, bottom-up grouping and semantic segmentation using RGB-D data. We focus on the challenging setting of cluttered indoor scenes, and evaluate our approach on the recently introduced NYU-Depth V2 (NYUD2) dataset [27]. We propose algorithms for object boundary detection and hierarchical segmentation that generalize the gPb-ucm approach of [2] by making effective use of depth information. We show that our system can label each contour with its type (depth, normal or albedo). We also propose a generic method for long-range amodal completion of surfaces and show its effectiveness in grouping. We then turn to the problem of semantic segmentation and propose a simple approach that classifies super pixels into the 40 dominant object categories in NYUD2. We use both generic and class-specific features to encode the appearance and geometry of objects. We also show how our approach can be used for scene classification, and how this contextual information in turn improves object recognition. In all of these tasks, we report significant improvements over the state-of-the-art.
Keywords :
feature extraction; geometry; image classification; image coding; image colour analysis; image resolution; image segmentation; object detection; object recognition; NYU-Depth V2 dataset; NYUD2 dataset; RGB-D data; RGB-D images; class-specific features; cluttered indoor scenes; contour detection problems; gPb-ucm approach; generic method; hierarchical segmentation; indoor scene recognition; long-range amodal surface completion; object appearance encoding; object boundary detection; object categories; object geometry encoding; object recognition; perceptual organization; semantic segmentation; semantic segmentation problem; superpixel classification approach; Benchmark testing; Gravity; Image color analysis; Image segmentation; Semantics; Shape; Three-dimensional displays; RGBD Recognition; RGBD Segmentation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on
Conference_Location :
Portland, OR
ISSN :
1063-6919
Type :
conf
DOI :
10.1109/CVPR.2013.79
Filename :
6618923
Link To Document :
بازگشت