DocumentCode
3328876
Title
Perceptual Organization and Recognition of Indoor Scenes from RGB-D Images
Author
Gupta, Swastik ; Arbelaez, Pablo ; Malik, Jagannath
Author_Institution
Univ. of California, Berkeley, Berkeley, CA, USA
fYear
2013
fDate
23-28 June 2013
Firstpage
564
Lastpage
571
Abstract
We address the problems of contour detection, bottom-up grouping and semantic segmentation using RGB-D data. We focus on the challenging setting of cluttered indoor scenes, and evaluate our approach on the recently introduced NYU-Depth V2 (NYUD2) dataset [27]. We propose algorithms for object boundary detection and hierarchical segmentation that generalize the gPb-ucm approach of [2] by making effective use of depth information. We show that our system can label each contour with its type (depth, normal or albedo). We also propose a generic method for long-range amodal completion of surfaces and show its effectiveness in grouping. We then turn to the problem of semantic segmentation and propose a simple approach that classifies super pixels into the 40 dominant object categories in NYUD2. We use both generic and class-specific features to encode the appearance and geometry of objects. We also show how our approach can be used for scene classification, and how this contextual information in turn improves object recognition. In all of these tasks, we report significant improvements over the state-of-the-art.
Keywords
feature extraction; geometry; image classification; image coding; image colour analysis; image resolution; image segmentation; object detection; object recognition; NYU-Depth V2 dataset; NYUD2 dataset; RGB-D data; RGB-D images; class-specific features; cluttered indoor scenes; contour detection problems; gPb-ucm approach; generic method; hierarchical segmentation; indoor scene recognition; long-range amodal surface completion; object appearance encoding; object boundary detection; object categories; object geometry encoding; object recognition; perceptual organization; semantic segmentation; semantic segmentation problem; superpixel classification approach; Benchmark testing; Gravity; Image color analysis; Image segmentation; Semantics; Shape; Three-dimensional displays; RGBD Recognition; RGBD Segmentation;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on
Conference_Location
Portland, OR
ISSN
1063-6919
Type
conf
DOI
10.1109/CVPR.2013.79
Filename
6618923
Link To Document