DocumentCode :
1442985
Title :
Bilayer Segmentation of Webcam Videos Using Tree-Based Classifiers
Author :
Yin, Pei ; Criminisi, Antonio ; Winn, John ; Essa, Irfan
Author_Institution :
Microsoft Corp., Redmond, WA, USA
Volume :
33
Issue :
1
fYear :
2011
Firstpage :
30
Lastpage :
42
Abstract :
This paper presents an automatic segmentation algorithm for video frames captured by a (monocular) webcam that closely approximates depth segmentation from a stereo camera. The frames are segmented into foreground and background layers that comprise a subject (participant) and other objects and individuals. The algorithm produces correct segmentations even in the presence of large background motion with a nearly stationary foreground. This research makes three key contributions: First, we introduce a novel motion representation, referred to as “motons,” inspired by research in object recognition. Second, we propose estimating the segmentation likelihood from the spatial context of motion. The estimation is efficiently learned by random forests. Third, we introduce a general taxonomy of tree-based classifiers that facilitates both theoretical and experimental comparisons of several known classification algorithms and generates new ones. In our bilayer segmentation algorithm, diverse visual cues such as motion, motion context, color, contrast, and spatial priors are fused by means of a conditional random field (CRF) model. Segmentation is then achieved by binary min-cut. Experiments on many sequences of our videochat application demonstrate that our algorithm, which requires no initialization, is effective in a variety of scenes, and the segmentation results are comparable to those obtained by stereo systems.
Keywords :
computer vision; decision trees; image classification; image motion analysis; image segmentation; learning (artificial intelligence); maximum likelihood estimation; object recognition; video communication; video signal processing; bilayer segmentation; computer vision; conditional random field; decision tree; image motion analysis; image understanding; likelihood estimation; machine learning; monocular Webcam video; object recognition; stereo camera; tree based classifier; Cameras; Classification algorithms; Classification tree analysis; Image segmentation; Lighting; Motion estimation; Object recognition; Robustness; Taxonomy; Videos; Computer vision; boosting; decision tree; image understanding; machine learning; motion analysis.; random forests; Algorithms; Artificial Intelligence; Computer Simulation; Humans; Image Processing, Computer-Assisted; Motion; Pattern Recognition, Automated; Video Recording;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/TPAMI.2010.65
Filename :
5432210
Link To Document :
بازگشت