DocumentCode
443168
Title
Incorporating visual knowledge representation in stereo reconstruction
Author
Barbu, Adrian ; Zhu, Song-Chun
Author_Institution
Departments of Comput. Sci. & Stat., California Univ., Los Angeles, CA, USA
Volume
1
fYear
2005
fDate
17-21 Oct. 2005
Firstpage
572
Abstract
In this paper, we present a two-layer generative model that incorporates generic middle-level visual knowledge for dense stereo reconstruction. The visual knowledge is represented by a dictionary of surface primitives including various categories of boundary discontinuities and junctions in parametric form. Given a stereo pair, we first compute a primal sketch representation which decomposes the image into a structural part for object boundaries and high intensity contrast represented by a 2D sketch graph, and a structure less part represented by Markov random field on pixels. Then we label the sketch graph and compute the 3D sketch (like a wire-frame) by fitting the primitive dictionary to the sketch graph. The surfaces between the 3D sketches are filled in by computing the depth of the MRP model on the structureless part. These two levels interact closely since the MRF is used to propagate information between the primitives, and at the same time, the primitives act as boundary conditions for the MRF. The two processes maximize a Bayesian posterior probability jointly. We propose an MCMC algorithm that simultaneously infers the 3D primitive types and parameters and estimates the depth of the scene. Our experiments show that this representation can infer the depth map with sharp boundaries and junctions for textureless images, curve objects and free-form shapes.
Keywords
Bayes methods; Markov processes; image reconstruction; image texture; knowledge representation; stereo image processing; Bayesian posterior probability; MCMC algorithm; Markov random field; curve object; free-form shape; image decomposition; object boundary; primal sketch representation; sketch graph; stereo reconstruction; textureless image; two-layer generative model; visual knowledge representation; Boundary conditions; Dictionaries; Image reconstruction; Knowledge representation; Markov random fields; Materials requirements planning; Pixel; Stereo image processing; Surface fitting; Surface reconstruction;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision, 2005. ICCV 2005. Tenth IEEE International Conference on
ISSN
1550-5499
Print_ISBN
0-7695-2334-X
Type
conf
DOI
10.1109/ICCV.2005.120
Filename
1541305
Link To Document