DocumentCode :
2822821
Title :
Accurate depth map estimation from video via MRF optimization
Author :
Tseng, Sheng-Po ; Lai, Shang-Hong
Author_Institution :
Dept. of Comput. Sci., Nat. Tsing Hua Univ., Hsinchu, Taiwan
fYear :
2011
fDate :
6-9 Nov. 2011
Firstpage :
1
Lastpage :
4
Abstract :
In this paper, we propose a novel system to estimate depth maps of outdoor scenes from a video sequence. According to the characteristics of a video, our approach considers more information in the temporal domain than the traditional depth reconstruction methods. We perform Structure From Motion (SfM) on consecutive image frames from a video from SIFT feature point correspondences, which provides some camera information, including 3D translation and rotation, for all the images. Then, we compute the constrained optical flow between selected scenes so that we can solve an over-constrained linear system to estimate the depth map for all pixels at each frame. In addition, mean shift image segmentation is incorporated to aggregate the depth estimation. Thus, this initial depth map is used as the data term of our pixel-based and region-based Markov Random Field (MRF) formulation for depth map estimation. The proposed MRF depth estimation not only imposes adaptive smoothness constraints but also includes sky detection in the final depth map estimation. By minimizing the associated MRF energy function for each frame, we obtain refined depth maps that achieve detail-preserving and temporally consistent depth estimation results.
Keywords :
Markov processes; adaptive signal processing; image motion analysis; image segmentation; image sequences; random processes; smoothing methods; transforms; video signal processing; 3D rotation; 3D translation; Markov random field optimization; SIFT feature point correspondences; adaptive smoothness constraints; constrained optical flow; depth map estimation; mean shift image segmentation; over-constrained linear system; pixel-based Markov random field formulation; region-based Markov random field formulation; sky detection; structure from motion; temporal domain; video sequence; Computer vision; Estimation; Image motion analysis; Optical imaging; Optimization; Three dimensional displays; Video sequences;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Visual Communications and Image Processing (VCIP), 2011 IEEE
Conference_Location :
Tainan
Print_ISBN :
978-1-4577-1321-7
Electronic_ISBN :
978-1-4577-1320-0
Type :
conf
DOI :
10.1109/VCIP.2011.6116005
Filename :
6116005
Link To Document :
بازگشت