• DocumentCode
    3333605
  • Title

    Discriminative Segment Annotation in Weakly Labeled Video

  • Author

    Tang, Ke ; Sukthankar, Rahul ; Yagnik, Jay ; Li Fei-Fei

  • fYear
    2013
  • fDate
    23-28 June 2013
  • Firstpage
    2483
  • Lastpage
    2490
  • Abstract
    The ubiquitous availability of Internet video offers the vision community the exciting opportunity to directly learn localized visual concepts from real-world imagery. Unfortunately, most such attempts are doomed because traditional approaches are ill-suited, both in terms of their computational characteristics and their inability to robustly contend with the label noise that plagues uncurated Internet content. We present CRANE, a weakly supervised algorithm that is specifically designed to learn under such conditions. First, we exploit the asymmetric availability of real-world training data, where small numbers of positive videos tagged with the concept are supplemented with large quantities of unreliable negative data. Second, we ensure that CRANE is robust to label noise, both in terms of tagged videos that fail to contain the concept as well as occasional negative videos that do. Finally, CRANE is highly parallelizable, making it practical to deploy at large scale without sacrificing the quality of the learned solution. Although CRANE is general, this paper focuses on segment annotation, where we show state-of-the-art pixel-level segmentation results on two datasets, one of which includes a training set of spatiotemporal segments from more than 20,000 videos.
  • Keywords
    Internet; image segmentation; learning (artificial intelligence); video retrieval; CRANE; Internet video; discriminative segment annotation; label noise; localized visual concepts; pixel-level segmentation; real-world training data; spatiotemporal segments; ubiquitous availability; vision community; weakly labeled video; weakly supervised algorithm; Cranes; Image segmentation; Lead; Noise; Spatiotemporal phenomena; Standards; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on
  • Conference_Location
    Portland, OR
  • ISSN
    1063-6919
  • Type

    conf

  • DOI
    10.1109/CVPR.2013.321
  • Filename
    6619165