مرکز منطقه ای اطلاع رساني علوم و فناوري - Video representation with three-dimensional entities

DocumentCode :

1253672

Title :

Video representation with three-dimensional entities

Author :

Martins, Fernando C M ; Moura, Jose M F

Author_Institution :

Dept. of Electr. & Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA, USA

Volume :

Issue :

fYear :

1998

fDate :

1/1/1998 12:00:00 AM

Firstpage :

Lastpage :

Abstract :

Very low bit-rate coding requires new paradigms that go well beyond pixel- and frame-based video representations. We introduce a novel content-based video representation using tridimensional entities: textured object models and pose estimates. The multiproperty object models carry stochastic information about the shape and texture of each object present in the scene. The pose estimates define the position and orientation of the objects for each frame. This representation is compact. It provides alternative means for handling video by manipulating and compositing three-dimensional (3-D) entities. We call this representation tridimensional video compositing, or 3DVC for short. We present the 3DVC framework and describe the methods used to construct incrementally the object models and the pose estimates from unregistered noisy depth and texture measurements. We also describe a method for video frame reconstruction based on 3-D scene assembly, and discuss potential applications of 3DVC to video coding and content-based handling. 3DVC assumes that the objects in the scene are rigid and segmented. By assuming segmentation, we do not address the difficult questions of nonrigid segmentation and multiple object segmentation. In our experiments, segmentation is obtained via depth thresholding. It is important to notice that 3DVC is independent of the segmentation technique adopted. Experimental results with synthetic and real video sequences where compression ratios in the range of 1:150-1:2700 are achieved demonstrate the applicability of the proposed representation to very low bit-rate coding

Keywords :

channel capacity; data compression; image reconstruction; image representation; image segmentation; image sequences; image texture; parameter estimation; video coding; 3D scene assembly; 3DVC; compression ratios; content-based video representation; depth thresholding; experimental results; frame-based video representations; object shape; orientation; pose estimates; position; segmentation; stochastic information; texture measurement; textured object models; three-dimensional entities; tridimensional video compositing; unregistered noisy depth measurement; very low bit-rate coding; video coding; video frame reconstruction; video sequences; Image reconstruction; Layout; Object oriented modeling; Pulse modulation; Shape; Stochastic processes; TV; Video coding; Video compression; Video sequences;

fLanguage :

English

Journal_Title :

Selected Areas in Communications, IEEE Journal on

Publisher :

ieee

ISSN :

0733-8716

Type :

jour

DOI :

10.1109/49.650921

Filename :

650921

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1253672