Title :
R-D optimized auxiliary information for inpainting-based view synthesis
Author :
Daribo, Ismael ; Cheung, Gene ; Maugey, Thomas ; Frossard, Pascal
Author_Institution :
Nat. Inst. of Inf. (NII), Tokyo, Japan
Abstract :
Texture and depth maps of two neighboring camera viewpoints are typically required for synthesis of an intermediate virtual view via depth-image-based rendering (DIBR). However, the bitrate overhead required for reconstruction of multiple texture and depth maps at decoder can be large. The performance of multiview video encoders such as MVC is limited by the simple fact that the chosen representation is inherently redundant: a texture or depth pixel visible from both camera viewpoints is represented twice. In this paper, we propose an alternative 3D scene representation without such redundancy, yet at decoder, one can still reconstruct texture and depth maps of two camera viewpoints for DIBR-based synthesis of intermediate views. In particular, we propose to first encode texture and depth videos of a single viewpoint, which are used to synthesize the uncoded viewpoint via DIBR at decoder. Then, we encode additional rate-distortion (RD) optimal auxiliary information (AI) to guide an inpainting-based hole-filling algorithm at decoder and complete the missing information due to disocclusion. For a missing pixel patch in the synthesized view, the AI can: i) be skipped and then let the decoder by itself retrieve the missing information, ii) identify a suitable spatial region in the reconstructed view for patch-matching, or iii) explicitly encode missing pixel patch if no satisfactory patch can be found in the reconstructed view. Experimental results show that our alternative representation can achieve up to 41% bit-savings compared to H.264/MVC implementation.
Keywords :
cameras; image representation; image texture; rendering (computer graphics); video coding; 3D scene representation; RD optimized auxiliary information; camera viewpoints; depth maps; depth-image-based rendering; image texture; inpainting-based hole-filling algorithm; inpainting-based view synthesis; multiview video encoders; patch matching; rate-distortion optimal auxiliary information; virtual view; Artificial intelligence; Bit rate; Cameras; Decoding; Image coding; Image reconstruction; Rendering (computer graphics); Texture-plus-depth format; compact representation; depth-image-based rendering;
Conference_Titel :
3DTV-Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON), 2012
Conference_Location :
Zurich
Print_ISBN :
978-1-4673-4904-8
Electronic_ISBN :
2161-2021
DOI :
10.1109/3DTV.2012.6365437