• DocumentCode
    2912955
  • Title

    Scene shape from texture of objects

  • Author

    Payet, Nadia ; Todorovic, Sinisa

  • Author_Institution
    Oregon State Univ., Corvallis, OR, USA
  • fYear
    2011
  • fDate
    20-25 June 2011
  • Firstpage
    2017
  • Lastpage
    2024
  • Abstract
    Joint reasoning about objects and 3D scene layout has shown great promise in scene interpretation. One visual cue that has been overlooked is texture arising from a spatial repetition of objects in the scene (e.g., windows of a building). Such texture provides scene-specific constraints among objects, and thus facilitates scene interpretation. We present an approach to: (1) detecting distinct textures of objects in a scene, (2) reconstructing the 3D shape of detected texture surfaces, and (3) combining object detections and shape-from-texture toward a globally consistent scene interpretation. Inference is formulated within the reinforcement learning framework as a sequential interpretation of image regions, starting from confident regions to guide the interpretation of other regions. Our algorithm finds an optimal policy that maps states of detected objects and reconstructed surfaces to actions which ought to be taken in those states, including detecting new objects and identifying new textures, so as to minimize a long-term loss. Tests against ground truth obtained from stereo images demonstrate that we can coarsely reconstruct a 3D model of the scene from a single image, without learning the layout of common scene surfaces, as done in prior work. We also show that reasoning about texture of objects improves object detection.
  • Keywords
    image reconstruction; image texture; inference mechanisms; object detection; stereo image processing; 3D scene layout; 3D shape reconstruction; joint reasoning; object detection; object texture; scene interpretation; scene shape; shape from texture; stereo images; Buildings; Detectors; Image reconstruction; Object detection; Surface reconstruction; Surface texture; Three dimensional displays;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on
  • Conference_Location
    Providence, RI
  • ISSN
    1063-6919
  • Print_ISBN
    978-1-4577-0394-2
  • Type

    conf

  • DOI
    10.1109/CVPR.2011.5995326
  • Filename
    5995326