• DocumentCode
    3013425
  • Title

    Tree-based Classifiers for Bilayer Video Segmentation

  • Author

    Yin, Pei ; Criminisi, Antonio ; Winn, John ; Essa, Irfan

  • Author_Institution
    Georgia Inst. of Technol., Atlanta
  • fYear
    2007
  • fDate
    17-22 June 2007
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    This paper presents an algorithm for the automatic segmentation of monocular videos into foreground and background layers. Correct segmentations are produced even in the presence of large background motion with nearly stationary foreground. There are three key contributions. The first is the introduction of a novel motion representation, "motons", inspired by research in object recognition. Second, we propose learning the segmentation likelihood from the spatial context of motion. The learning is efficiently performed by Random Forests. The third contribution is a general taxonomy of tree-based classifiers, which facilitates theoretical and experimental comparisons of several known classification algorithms, as well as spawning new ones. Diverse visual cues such as motion, motion context, colour, contrast and spatial priors are fused together by means of a conditional random field (CRF) model. Segmentation is then achieved by binary min-cut. Our algorithm requires no initialization. Experiments on many video-chat type sequences demonstrate the effectiveness of our algorithm in a variety of scenes. The segmentation results are comparable to those obtained by stereo systems.
  • Keywords
    image classification; image motion analysis; image representation; image segmentation; image sequences; object recognition; trees (mathematics); video signal processing; CRF model; background layer; bilayer monocular video segmentation; conditional random field; foreground layer; motion representation; object recognition; random forests; tree-based classifier; video-chat type sequences; Cameras; Classification tree analysis; Geometrical optics; Image motion analysis; Image segmentation; Labeling; Lighting; Optical computing; Robustness; Video compression;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition, 2007. CVPR '07. IEEE Conference on
  • Conference_Location
    Minneapolis, MN
  • ISSN
    1063-6919
  • Print_ISBN
    1-4244-1179-3
  • Electronic_ISBN
    1063-6919
  • Type

    conf

  • DOI
    10.1109/CVPR.2007.383008
  • Filename
    4270033