• DocumentCode
    1442985
  • Title

    Bilayer Segmentation of Webcam Videos Using Tree-Based Classifiers

  • Author

    Yin, Pei ; Criminisi, Antonio ; Winn, John ; Essa, Irfan

  • Author_Institution
    Microsoft Corp., Redmond, WA, USA
  • Volume
    33
  • Issue
    1
  • fYear
    2011
  • Firstpage
    30
  • Lastpage
    42
  • Abstract
    This paper presents an automatic segmentation algorithm for video frames captured by a (monocular) webcam that closely approximates depth segmentation from a stereo camera. The frames are segmented into foreground and background layers that comprise a subject (participant) and other objects and individuals. The algorithm produces correct segmentations even in the presence of large background motion with a nearly stationary foreground. This research makes three key contributions: First, we introduce a novel motion representation, referred to as “motons,” inspired by research in object recognition. Second, we propose estimating the segmentation likelihood from the spatial context of motion. The estimation is efficiently learned by random forests. Third, we introduce a general taxonomy of tree-based classifiers that facilitates both theoretical and experimental comparisons of several known classification algorithms and generates new ones. In our bilayer segmentation algorithm, diverse visual cues such as motion, motion context, color, contrast, and spatial priors are fused by means of a conditional random field (CRF) model. Segmentation is then achieved by binary min-cut. Experiments on many sequences of our videochat application demonstrate that our algorithm, which requires no initialization, is effective in a variety of scenes, and the segmentation results are comparable to those obtained by stereo systems.
  • Keywords
    computer vision; decision trees; image classification; image motion analysis; image segmentation; learning (artificial intelligence); maximum likelihood estimation; object recognition; video communication; video signal processing; bilayer segmentation; computer vision; conditional random field; decision tree; image motion analysis; image understanding; likelihood estimation; machine learning; monocular Webcam video; object recognition; stereo camera; tree based classifier; Cameras; Classification algorithms; Classification tree analysis; Image segmentation; Lighting; Motion estimation; Object recognition; Robustness; Taxonomy; Videos; Computer vision; boosting; decision tree; image understanding; machine learning; motion analysis.; random forests; Algorithms; Artificial Intelligence; Computer Simulation; Humans; Image Processing, Computer-Assisted; Motion; Pattern Recognition, Automated; Video Recording;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.2010.65
  • Filename
    5432210