مرکز منطقه ای اطلاع رساني علوم و فناوري - Spatiotemporal segmentation and tracking of objects for visualization of videoconference image sequences

DocumentCode :

1416171

Title :

Spatiotemporal segmentation and tracking of objects for visualization of videoconference image sequences

Author :

Kompatsiaris, Ioannis ; Gerassimos Strintz, M.

Author_Institution :

Inf. Processing Lab., Aristotelian Univ. of Thessaloniki, Greece

Volume :

Issue :

fYear :

2000

fDate :

12/1/2000 12:00:00 AM

Firstpage :

1388

Lastpage :

1402

Abstract :

A procedure is described for the segmentation, content-based coding, and visualization of videoconference image sequences. First, image sequence analysis is used to estimate the shape and motion parameters of the person facing the camera. A spatiotemporal filter, taking into account the intensity differences between consequent frames, is applied, in order to separate the moving person from the static background. The foreground is segmented in a number of regions in order to identify the face. For this purpose, we propose the novel procedure of K-means with connectivity constraint algorithm as a general segmentation algorithm combining several types of information including intensity, motion and compactness. In this algorithm, the use of spatiotemporal regions is introduced since a number of frames are analyzed simultaneously and as a result, the same region is present in consequent frames. Based on this information, a 3-D ellipsoid is adapted to the person´s face using an efficient and robust algorithm. The rigid 3-D motion is estimated next using a least median of squares approach, Finally, a virtual reality modeling language (VRML) file is created containing all the above information; this file may be viewed by using any VRML 2.0 compliant browser

Keywords :

data compression; filtering theory; image segmentation; image sequences; motion estimation; teleconferencing; tracking filters; video coding; virtual reality languages; 3D ellipsoid; 3D motion estimation; K-means algorithm; VRML 2.0 compliant browser; VRML file; camera; connectivity constraint algorithm; content-based coding; efficient algorithm; face identification; foreground; image sequence analysis; image visualization; intensity differences; motion parameter estimation; robust algorithm; segmentation algorithm; shape parameter estimation; spatiotemporal filter; spatiotemporal object segmentation; spatiotemporal object tracking; spatiotemporal regions; static background; video coding; videoconference image sequences; virtual reality modeling language; Image coding; Image segmentation; Image sequence analysis; Image sequences; Motion analysis; Motion estimation; Shape; Spatiotemporal phenomena; Videoconference; Visualization;

fLanguage :

English

Journal_Title :

Circuits and Systems for Video Technology, IEEE Transactions on

Publisher :

ieee

ISSN :

1051-8215

Type :

jour

DOI :

10.1109/76.889030

Filename :

889030

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1416171