Title :
Spatiotemporal segmentation and tracking of objects for visualization of videoconference image sequences
Author :
Kompatsiaris, Ioannis ; Gerassimos Strintz, M.
Author_Institution :
Inf. Processing Lab., Aristotelian Univ. of Thessaloniki, Greece
fDate :
12/1/2000 12:00:00 AM
Abstract :
A procedure is described for the segmentation, content-based coding, and visualization of videoconference image sequences. First, image sequence analysis is used to estimate the shape and motion parameters of the person facing the camera. A spatiotemporal filter, taking into account the intensity differences between consequent frames, is applied, in order to separate the moving person from the static background. The foreground is segmented in a number of regions in order to identify the face. For this purpose, we propose the novel procedure of K-means with connectivity constraint algorithm as a general segmentation algorithm combining several types of information including intensity, motion and compactness. In this algorithm, the use of spatiotemporal regions is introduced since a number of frames are analyzed simultaneously and as a result, the same region is present in consequent frames. Based on this information, a 3-D ellipsoid is adapted to the person´s face using an efficient and robust algorithm. The rigid 3-D motion is estimated next using a least median of squares approach, Finally, a virtual reality modeling language (VRML) file is created containing all the above information; this file may be viewed by using any VRML 2.0 compliant browser
Keywords :
data compression; filtering theory; image segmentation; image sequences; motion estimation; teleconferencing; tracking filters; video coding; virtual reality languages; 3D ellipsoid; 3D motion estimation; K-means algorithm; VRML 2.0 compliant browser; VRML file; camera; connectivity constraint algorithm; content-based coding; efficient algorithm; face identification; foreground; image sequence analysis; image visualization; intensity differences; motion parameter estimation; robust algorithm; segmentation algorithm; shape parameter estimation; spatiotemporal filter; spatiotemporal object segmentation; spatiotemporal object tracking; spatiotemporal regions; static background; video coding; videoconference image sequences; virtual reality modeling language; Image coding; Image segmentation; Image sequence analysis; Image sequences; Motion analysis; Motion estimation; Shape; Spatiotemporal phenomena; Videoconference; Visualization;
Journal_Title :
Circuits and Systems for Video Technology, IEEE Transactions on