DocumentCode :
3672379
Title :
Towards 3D object detection with bimodal deep Boltzmann machines over RGBD imagery
Author :
Wei Liu;Rongrong Ji; Shaozi Li
Author_Institution :
Dep. of Cognitive Science, School of Info. Science and Eng., Xiamen University, China
fYear :
2015
fDate :
6/1/2015 12:00:00 AM
Firstpage :
3013
Lastpage :
3021
Abstract :
Nowadays, detecting objects in 3D scenes like point clouds has become an emerging challenge with various applications. However, it retains as an open problem due to the deficiency of labeling 3D training data. To deploy an accurate detection algorithm typically resorts to investigating both RGB and depth modalities, which have distinct statistics while correlated with each other. Previous research mainly focus on detecting objects using only one modality, which ignores exploiting the cross-modality cues. In this work, we propose a cross-modality deep learning framework based on deep Boltzmann Machines for 3D Scenes object detection. In particular, we demonstrate that by learning cross-modality feature from RGBD data, it is possible to capture their joint information to reinforce detector trainings in individual modalities. In particular, we slide a 3D detection window in the 3D point cloud to match the exemplar shape, which the lack of training data in 3D domain is conquered via (1) We collect 3D CAD models and 2D positive samples from Internet. (2) adopt pretrained R-CNNs [2] to extract raw feature from both RGB and Depth domains. Experiments on RMRC dataset demonstrate that the bimodal based deep feature learning framework helps 3D scene object detection.
Keywords :
"Three-dimensional displays","Solid modeling","Training","Frequency modulation","Joints","Detectors","Feature extraction"
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2015 IEEE Conference on
Electronic_ISBN :
1063-6919
Type :
conf
DOI :
10.1109/CVPR.2015.7298920
Filename :
7298920
Link To Document :
بازگشت