Title :
Sensor fusion for semantic segmentation of urban scenes
Author :
Zhang, Richard ; Candra, Stefan A. ; Vetter, Kai ; Zakhor, Avideh
Author_Institution :
Dept. of Electr. Eng. & Comput. Sci., Univ. of California, Berkeley, Berkeley, CA, USA
Abstract :
Semantic understanding of environments is an important problem in robotics in general and intelligent autonomous systems in particular. In this paper, we propose a semantic segmentation algorithm which effectively fuses information from images and 3D point clouds. The proposed method incorporates information from multiple scales in an intuitive and effective manner. A late-fusion architecture is proposed to maximally leverage the training data in each modality. Finally, a pairwise Conditional Random Field (CRF) is used as a post-processing step to enforce spatial consistency in the structured prediction. The proposed algorithm is evaluated on the publicly available KITTI dataset [1] [2], augmented with additional pixel and point-wise semantic labels for building, sky, road, vegetation, sidewalk, car, pedestrian, cyclist, sign/pole, and fence regions. A per-pixel accuracy of 89.3% and average class accuracy of 65.4% is achieved, well above current state-of-the-art [3].
Keywords :
image fusion; image segmentation; 3D point clouds; KITTI dataset; conditional random field; late-fusion architecture; pairwise CRF; point-wise semantic labels; sensor fusion; urban scenes semantic segmentation; Feature extraction; Image segmentation; Labeling; Semantics; Three-dimensional displays; Training; Training data;
Conference_Titel :
Robotics and Automation (ICRA), 2015 IEEE International Conference on
Conference_Location :
Seattle, WA
DOI :
10.1109/ICRA.2015.7139439