DocumentCode :
996238
Title :
Real-time object segmentation and coding for selective-quality video communications
Author :
Challapali, Kiran ; Brodsky, Tomas ; Lin, Yun-Ting ; Yan, Yong ; Chen, Richard Yi
Author_Institution :
Philips Res., Briarcliff Manor, NY, USA
Volume :
14
Issue :
6
fYear :
2004
fDate :
6/1/2004 12:00:00 AM
Firstpage :
813
Lastpage :
824
Abstract :
The MPEG-4 standard enables the representation of video as a collection of objects. This paper describes an automatic system that exploits such a representation. Our system consists of two parts: real-time content extraction algorithms and a real-time multi-object rate control method. We present two approaches to content extraction: foreground segmentation based on two cameras and face segmentation based on a single camera. The main contributions of this paper are: 1) under a stereo camera setup, we improve a disparity estimation algorithm to obtain crisp and smooth boundaries of foreground objects; 2) for a single camera scenario, we propose a novel algorithm for face detection and tracking, combining facial color and structure information; and 3) we develop a constant-quality variable bitrate (CQ-VBR) control algorithm that guarantees the quality specification for each object obtained from the two content extraction methods. Both segmentation algorithms run in real-time on a low-cost media processor, and have been tested extensively in various indoor environments. The CQ-VBR control algorithm is a useful tool for the evaluation of object-based coding. For low-bit-rate applications, we can achieve significant reduction in the overall bitrate, while maintaining the same visual quality of the foreground/face object as compared to conventional frame-based coding. Based on tests conducted on several sequences of different complexity levels, the bit-rate savings can be up to 48%. The satisfactory foreground segmentation (results presented) permits porting a live foreground object into arbitrary scenes to create composite video.
Keywords :
feature extraction; image colour analysis; image segmentation; object detection; real-time systems; stereo image processing; video coding; visual communication; MPEG-4 standard; constant quality variable bit rate control; content-extraction method; disparity estimation algorithm; face detection; face segmentation; face tracking; facial color information; facial structure information; foreground segmentation; indoor environment; media processor; object coding; object-based coding; real-time content extraction; real-time multiobject rate control method; real-time object segmentation; selective-quality video communication; stereo camera setup; video data segmentation; Automatic control; Bit rate; Cameras; Communication system control; Control systems; Face detection; MPEG 4 Standard; Object segmentation; Real time systems; Testing; MPEG-4; multi-object coding; object segmentation; rate control; real-time content extraction;
fLanguage :
English
Journal_Title :
Circuits and Systems for Video Technology, IEEE Transactions on
Publisher :
ieee
ISSN :
1051-8215
Type :
jour
DOI :
10.1109/TCSVT.2004.828337
Filename :
1302162
Link To Document :
بازگشت