Title :
Fusion of dense spatial features and sparse temporal features for three-dimensional structure estimation in urban scenes
Author :
Nawaf, Mohamad Motasem ; TreÌmeau, Alain
Author_Institution :
Lab. Hubert Curien, Univ. Jean Monnet, St. Étienne, France
Abstract :
The authors present a novel approach to improve three-dimensional (3D) structure estimation from an image stream in urban scenes. The authors consider a particular setup, where the camera is installed on a moving vehicle. Applying traditional structure from motion (SfM) technique in this case generates poor estimation of the 3D structure because of several reasons such as texture-less images, small baseline variations and dominant forward camera motion. The authors idea is to introduce the monocular depth cues that exist in a single image, and add time constraints on the estimated 3D structure. The scene is modelled as a set of small planar patches obtained using over-segmentation, and the goal is to estimate the 3D positioning of these planes. The authors propose a fusion scheme that employs Markov random field model to integrate spatial and temporal depth features. Spatial depth is obtained by learning a set of global and local image features. Temporal depth is obtained via sparse optical flow based SfM approach. That allows decreasing the estimation ambiguity by forcing some constraints on camera motion. Finally, the authors apply a fusion scheme to create unique 3D structure estimation.
Keywords :
Markov processes; cameras; estimation theory; image fusion; image segmentation; image sequences; random processes; 3D positioning; 3D structure estimation; Markov random field model; dense spatial feature fusion scheme; dominant forward camera motion; estimation ambiguity; image oversegmentation; image stream; monocular depth cues; moving vehicle; small baseline variations; small planar patches; sparse one temporal feature fusion scheme; sparse optical flow-based SfM approach; spatial depth features; structure from motion technique; temporal depth features; textureless image; three-dimensional structure estimation; urban scenes;
Journal_Title :
Computer Vision, IET
DOI :
10.1049/iet-cvi.2012.0270