• DocumentCode
    1864746
  • Title

    A multimodal approach for image de-fencing and depth inpainting

  • Author

    Jonna, Sankaraganesh ; Voleti, Vikram S. ; Sahay, Rajiv R. ; Kankanhalli, Mohan S.

  • Author_Institution
    Dept. of Electr. Eng., IIT Kharagpur, Kharagpur, India
  • fYear
    2015
  • fDate
    4-7 Jan. 2015
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Low cost RGB-D sensors such as the Microsoft Kinect have enabled the use of depth data along with color images. In this work, we propose a multi-modal approach to address the problem of removal of fences/occlusions from images captured using a Kinect camera. We also perform depth completion by fusing data from multiple recorded depth maps affected by occlusions. The availability of aligned image and depth data from Kinect aids us in the detection of the fence locations. However, accurate estimation of the relative shifts between the captured color frames is necessary. Initially, for the case of static scene elements with simple relative motion between the camera and the objects, we propose the use of affine scale-invariant feature transform descriptor (ASIFT) to compute the relative global displacements. We also address the scenario wherein the relative motion between the frames may not be global using the depth map obtained by Kinect. For such a scenario involving complex motion of scene pixels, we use a recently proposed robust optical flow technique. We show results for challenging real-world data wherein the scene is dynamic. The inverse ill-posed problems of estimation of the de-fenced image and the inpainted depth map are solved using an optimization-based framework. Specifically, we model the unoccluded image and the completed depth map as two distinct Markov random fields, respectively, and obtain their maximum a-posteriori estimates using loopy belief propagation.
  • Keywords
    Markov processes; affine transforms; image capture; image colour analysis; image fusion; image motion analysis; image sensors; image sequences; maximum likelihood estimation; optimisation; random processes; ASIFT; Markov random fields; Microsoft Kinect camera; affine scale-invariant feature transform descriptor; color frames; color images; data fusion; depth data; depth inpainting; fence location detection; image data; image defencing; loopy belief propagation; low cost RGB-D sensors; maximum a-posteriori estimates; multimodal approach; optimization-based framework; recorded depth maps; robust optical flow technique; scene pixels; static scene elements; Cameras; Image color analysis; Image reconstruction; Optical imaging; Optical sensors; Videos; Belief propagation; Image de-fencing; Inpainting; Kinect; Markov random field; RGB-D data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advances in Pattern Recognition (ICAPR), 2015 Eighth International Conference on
  • Conference_Location
    Kolkata
  • Type

    conf

  • DOI
    10.1109/ICAPR.2015.7050696
  • Filename
    7050696