Automatic segmentation of moving objects in video sequences: a region labeling approach

Author

Tsaig, Yaakov ; Averbuch, Amir

Author_Institution

Dept. of Comput. Sci., Tel Aviv Univ., Israel

Volume

12

Issue

7

fYear

2002

fDate

7/1/2002 12:00:00 AM

Firstpage

597

Lastpage

612

Abstract

The emerging video coding standard MPEG-4 enables various content-based functionalities for multimedia applications. To support such functionalities, as well as to improve coding efficiency, MPEG-4 relies on a decomposition of each frame of an image sequence into video object planes (VOP). Each VOP corresponds to a single moving object in the scene. This paper presents a new method for automatic segmentation of moving objects in image sequences for VOP extraction. We formulate the problem as graph labeling over a region adjacency graph (RAG), based on motion information. The label field is modeled as a Markov random field (MRF). An initial spatial partition of each frame is obtained by a fast, floating-point based implementation of the watershed algorithm. The motion of each region is estimated by hierarchical region matching. To avoid inaccuracies in occlusion areas, a novel motion validation scheme is presented. A dynamic memory, based on object tracking, is incorporated into the segmentation process to maintain temporal coherence of the segmentation. Finally, a labeling is obtained by maximization of the a posteriori probability of the MRF using motion information, spatial information and the memory. The optimization is carried out by highest confidence first (HCF). Experimental results for several video sequences demonstrate the effectiveness of the proposed approach

Keywords

Markov processes; code standards; feature extraction; graph theory; image matching; image segmentation; image sequences; motion estimation; multimedia communication; optimisation; video coding; MPEG-4; Markov random field; VOP extraction; a posteriori probability; automatic segmentation; content-based multimedia; dynamic memory; floating-point watershed algorithm; frame decomposition; graph labeling; hierarchical region matching; highest confidence first optimization; image sequence; image sequences; maximization; motion estimation; object tracking; region adjacency graph; temporal coherence; video coding standard; video object planes; video segmentation; Data mining; Image coding; Image segmentation; Image sequences; Labeling; Layout; MPEG 4 Standard; Markov random fields; Video coding; Video sequences;

fLanguage

English

Journal_Title

Circuits and Systems for Video Technology, IEEE Transactions on

Publisher

ieee

ISSN

1051-8215

Type

jour

DOI

10.1109/TCSVT.2002.800513

Filename

1015672