Title :
Real-Time Visibility-Based Fusion of Depth Maps
Author :
Merrell, Paul ; Akbarzadeh, Amir ; Wang, Liang ; Mordohai, Philippos ; Frahm, Jan-Michael ; Yang, Ruigang ; Nistér, David ; Pollefeys, Marc
Author_Institution :
Univ. of North Carolina, Chapel Hill
Abstract :
We present a viewpoint-based approach for the quick fusion of multiple stereo depth maps. Our method selects depth estimates for each pixel that minimize violations of visibility constraints and thus remove errors and inconsistencies from the depth maps to produce a consistent surface. We advocate a two-stage process in which the first stage generates potentially noisy, overlapping depth maps from a set of calibrated images and the second stage fuses these depth maps to obtain an integrated surface with higher accuracy, suppressed noise, and reduced redundancy. We show that by dividing the processing into two stages we are able to achieve a very high throughput because we are able to use a computationally cheap stereo algorithm and because this architecture is amenable to hardware-accelerated (GPU) implementations. A rigorous formulation based on the notion of stability of a depth estimate is presented first. It aims to determine the validity of a depth estimate by rendering multiple depth maps into the reference view as well as rendering the reference depth map into the other views in order to detect occlusions and free- space violations. We also present an approximate alternative formulation that selects and validates only one hypothesis based on confidence. Both formulations enable us to perform video-based reconstruction at up to 25 frames per second. We show results on the multi-view stereo evaluation benchmark datasets and several outdoors video sequences. Extensive quantitative analysis is performed using an accurately surveyed model of a real building as ground truth.
Keywords :
hidden feature removal; image fusion; image reconstruction; rendering (computer graphics); stereo image processing; video signal processing; graphical processing unit; multiple stereo depth map rendering; occlusion detection; real-time visibility-based fusion; video-based 3D shape reconstruction; Computer architecture; Fuses; Fusion power generation; Image reconstruction; Noise generators; Noise reduction; Redundancy; Stability; Throughput; Video sequences;
Conference_Titel :
Computer Vision, 2007. ICCV 2007. IEEE 11th International Conference on
Conference_Location :
Rio de Janeiro
Print_ISBN :
978-1-4244-1630-1
Electronic_ISBN :
1550-5499
DOI :
10.1109/ICCV.2007.4408984