A multimodal approach for image de-fencing and depth inpainting

Author

Jonna, Sankaraganesh ; Voleti, Vikram S. ; Sahay, Rajiv R. ; Kankanhalli, Mohan S.

Author_Institution

Dept. of Electr. Eng., IIT Kharagpur, Kharagpur, India

fYear

2015

fDate

4-7 Jan. 2015

Firstpage

1

Lastpage

6

Abstract

Low cost RGB-D sensors such as the Microsoft Kinect have enabled the use of depth data along with color images. In this work, we propose a multi-modal approach to address the problem of removal of fences/occlusions from images captured using a Kinect camera. We also perform depth completion by fusing data from multiple recorded depth maps affected by occlusions. The availability of aligned image and depth data from Kinect aids us in the detection of the fence locations. However, accurate estimation of the relative shifts between the captured color frames is necessary. Initially, for the case of static scene elements with simple relative motion between the camera and the objects, we propose the use of affine scale-invariant feature transform descriptor (ASIFT) to compute the relative global displacements. We also address the scenario wherein the relative motion between the frames may not be global using the depth map obtained by Kinect. For such a scenario involving complex motion of scene pixels, we use a recently proposed robust optical flow technique. We show results for challenging real-world data wherein the scene is dynamic. The inverse ill-posed problems of estimation of the de-fenced image and the inpainted depth map are solved using an optimization-based framework. Specifically, we model the unoccluded image and the completed depth map as two distinct Markov random fields, respectively, and obtain their maximum a-posteriori estimates using loopy belief propagation.

Keywords

Markov processes; affine transforms; image capture; image colour analysis; image fusion; image motion analysis; image sensors; image sequences; maximum likelihood estimation; optimisation; random processes; ASIFT; Markov random fields; Microsoft Kinect camera; affine scale-invariant feature transform descriptor; color frames; color images; data fusion; depth data; depth inpainting; fence location detection; image data; image defencing; loopy belief propagation; low cost RGB-D sensors; maximum a-posteriori estimates; multimodal approach; optimization-based framework; recorded depth maps; robust optical flow technique; scene pixels; static scene elements; Cameras; Image color analysis; Image reconstruction; Optical imaging; Optical sensors; Videos; Belief propagation; Image de-fencing; Inpainting; Kinect; Markov random field; RGB-D data;

fLanguage

English

Publisher

ieee

Conference_Titel

Advances in Pattern Recognition (ICAPR), 2015 Eighth International Conference on

Conference_Location

Kolkata

Type

conf

DOI

10.1109/ICAPR.2015.7050696

Filename

7050696