DocumentCode :
2717572
Title :
RGB-(D) scene labeling: Features and algorithms
Author :
Ren, Xiaofeng ; Bo, Liefeng ; Fox, Dieter
fYear :
2012
fDate :
16-21 June 2012
Firstpage :
2759
Lastpage :
2766
Abstract :
Scene labeling research has mostly focused on outdoor scenes, leaving the harder case of indoor scenes poorly understood. Microsoft Kinect dramatically changed the landscape, showing great potentials for RGB-D perception (color+depth). Our main objective is to empirically understand the promises and challenges of scene labeling with RGB-D. We use the NYU Depth Dataset as collected and analyzed by Silberman and Fergus [30]. For RGB-D features, we adapt the framework of kernel descriptors that converts local similarities (kernels) to patch descriptors. For contextual modeling, we combine two lines of approaches, one using a superpixel MRF, and the other using a segmentation tree. We find that (1) kernel descriptors are very effective in capturing appearance (RGB) and shape (D) similarities; (2) both superpixel MRF and segmentation tree are useful in modeling context; and (3) the key to labeling accuracy is the ability to efficiently train and test with large-scale data. We improve labeling accuracy on the NYU Dataset from 56.6% to 76.1%. We also apply our approach to image-only scene labeling and improve the accuracy on the Stanford Background Dataset from 79.4% to 82.9%.
Keywords :
image colour analysis; image segmentation; trees (mathematics); Microsoft Kinect; RGB-(D) scene labeling; RGB-D features; RGB-D perception; contextual modeling; indoor scenes; kernel descriptors; local similarities; outdoor scenes; patch descriptors; scene labeling research; segmentation tree; superpixel MRF; Accuracy; Context modeling; Image color analysis; Image segmentation; Kernel; Labeling; Vegetation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on
Conference_Location :
Providence, RI
ISSN :
1063-6919
Print_ISBN :
978-1-4673-1226-4
Electronic_ISBN :
1063-6919
Type :
conf
DOI :
10.1109/CVPR.2012.6247999
Filename :
6247999
Link To Document :
بازگشت