Title :
Automatic segmentation of moving objects in video sequences: a region labeling approach
Author :
Tsaig, Yaakov ; Averbuch, Amir
Author_Institution :
Dept. of Comput. Sci., Tel Aviv Univ., Israel
fDate :
7/1/2002 12:00:00 AM
Abstract :
The emerging video coding standard MPEG-4 enables various content-based functionalities for multimedia applications. To support such functionalities, as well as to improve coding efficiency, MPEG-4 relies on a decomposition of each frame of an image sequence into video object planes (VOP). Each VOP corresponds to a single moving object in the scene. This paper presents a new method for automatic segmentation of moving objects in image sequences for VOP extraction. We formulate the problem as graph labeling over a region adjacency graph (RAG), based on motion information. The label field is modeled as a Markov random field (MRF). An initial spatial partition of each frame is obtained by a fast, floating-point based implementation of the watershed algorithm. The motion of each region is estimated by hierarchical region matching. To avoid inaccuracies in occlusion areas, a novel motion validation scheme is presented. A dynamic memory, based on object tracking, is incorporated into the segmentation process to maintain temporal coherence of the segmentation. Finally, a labeling is obtained by maximization of the a posteriori probability of the MRF using motion information, spatial information and the memory. The optimization is carried out by highest confidence first (HCF). Experimental results for several video sequences demonstrate the effectiveness of the proposed approach
Keywords :
Markov processes; code standards; feature extraction; graph theory; image matching; image segmentation; image sequences; motion estimation; multimedia communication; optimisation; video coding; MPEG-4; Markov random field; VOP extraction; a posteriori probability; automatic segmentation; content-based multimedia; dynamic memory; floating-point watershed algorithm; frame decomposition; graph labeling; hierarchical region matching; highest confidence first optimization; image sequence; image sequences; maximization; motion estimation; object tracking; region adjacency graph; temporal coherence; video coding standard; video object planes; video segmentation; Data mining; Image coding; Image segmentation; Image sequences; Labeling; Layout; MPEG 4 Standard; Markov random fields; Video coding; Video sequences;
Journal_Title :
Circuits and Systems for Video Technology, IEEE Transactions on
DOI :
10.1109/TCSVT.2002.800513