• DocumentCode
    253576
  • Title

    The Shape-Time Random Field for Semantic Video Labeling

  • Author

    Kae, Andrew ; Marlin, Benjamin ; Learned-Miller, Erik

  • Author_Institution
    Sch. of Comput. Sci., Univ. of Massachusetts, Amherst, MA, USA
  • fYear
    2014
  • fDate
    23-28 June 2014
  • Firstpage
    272
  • Lastpage
    279
  • Abstract
    We propose a novel discriminative model for semantic labeling in videos by incorporating a prior to model both the shape and temporal dependencies of an object in video. A typical approach for this task is the conditional random field (CRF), which can model local interactions among adjacent regions in a video frame. Recent work has shown how to incorporate a shape prior into a CRF for improving labeling performance, but it may be difficult to model temporal dependencies present in video by using this prior. The conditional restricted Boltzmann machine (CRBM) can model both shape and temporal dependencies, and has been used to learn walking styles from motion- capture data. In this work, we incorporate a CRBM prior into a CRF framework and present a new state-of-the-art model for the task of semantic labeling in videos. In particular, we explore the task of labeling parts of complex face scenes from videos in the YouTube Faces Database (YFDB). Our combined model outperforms competitive baselines both qualitatively and quantitatively.
  • Keywords
    Boltzmann machines; face recognition; natural scenes; random processes; video signal processing; visual databases; CRBM; CRF framework; YFDB; YouTube face database; complex face scenes; conditional random field; conditional restricted Boltzmann machine; discriminative model; labeling performance improvement; local interactions; semantic video labeling; shape dependencies; shape-time random field; temporal dependencies; video frame; Computational modeling; Face; History; Labeling; Mathematical model; Semantics; Shape; CRF; RBM; deep learning; deep model; faces; image labeling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on
  • Conference_Location
    Columbus, OH
  • Type

    conf

  • DOI
    10.1109/CVPR.2014.42
  • Filename
    6909436