مرکز منطقه ای اطلاع رساني علوم و فناوري - Learning 3-D Scene Structure from a Single Still Image

DocumentCode :

2457316

Title :

Learning 3-D Scene Structure from a Single Still Image

Author :

Saxena, Ashutosh ; Sun, Min ; Ng, Andrew Y.

Author_Institution :

Stanford Univ., Stanford

fYear :

2007

fDate :

14-21 Oct. 2007

Firstpage :

Lastpage :

Abstract :

We consider the problem of estimating detailed 3D structure from a single still image of an unstructured environment. Our goal is to create 3D models which are both quantitatively accurate as well as visually pleasing. For each small homogeneous patch in the image, we use a Markov random field (MRF) to infer a set of "plane parameters" that capture both the 3D location and 3D orientation of the patch. The MRF, trained via supervised learning, models both image depth cues as well as the relationships between different parts of the image. Inference in our model is tractable, and requires only solving a convex optimization problem. Other than assuming that the environment is made up of a number of small planes, our model makes no explicit assumptions about the structure of the scene; this enables the algorithm to capture much more detailed 3D structure than does prior art (such as Saxena et ah, 2005, Delage et ah, 2005, and Hoiem et el, 2005), and also give a much richer experience in the 3D flythroughs created using image-based rendering, even for scenes with significant non-vertical structure. Using this approach, we have created qualitatively correct 3D models for 64.9% of 588 images downloaded from the Internet, as compared to Hoiem et al.\´s performance of 33.1%. Further, our models are quantitatively more accurate than either Saxena et al. or Hoiem et al.

Keywords :

Markov processes; convex programming; image processing; learning (artificial intelligence); random processes; rendering (computer graphics); 3D scene structure; Markov random field; convex optimization problem; image-based rendering; single still image; supervised learning; Art; Computer science; Humans; Inference algorithms; Internet; Layout; Markov random fields; Rendering (computer graphics); Sun; Supervised learning;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Vision, 2007. ICCV 2007. IEEE 11th International Conference on

Conference_Location :

Rio de Janeiro

ISSN :

1550-5499

Print_ISBN :

978-1-4244-1630-1

Electronic_ISBN :

1550-5499

Type :

conf

DOI :

10.1109/ICCV.2007.4408828

Filename :

4408828

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2457316