• DocumentCode
    2964476
  • Title

    Automatic Concept Detector Refinement for Large-Scale Video Semantic Annotation

  • Author

    Liu, Xueliang ; Huet, Benoit

  • Author_Institution
    EURECOM Inst., Sophia-Antipolis, France
  • fYear
    2010
  • fDate
    22-24 Sept. 2010
  • Firstpage
    97
  • Lastpage
    100
  • Abstract
    With the explosion of content sharing web site, an unprecedented amount of multimedia items are made available online on a day to day basis. Since search engine technologies rely essentially on textual information there is an urgent need to infer relevant semantic description through content based analysis on those multimedia documents. In this paper, we propose an approach which leverages the sheer volume of data available online to refine semantic concept detectors for videos annotation without requiring any additional human interaction. To address the problem in a realistic setting, we have collected a large video collection of about 42 thousand videos crawled from YouTube. A number of low-level features are extracted from those videos and are included within the corpus. Upon training on a small initial set of labeled video shots, the concept detectors are run on the large scale unlabeled corpus in order to identify and select new training samples. Thanks to this inexpensively obtained set of new training examples the concept detectors can be reinforced and enhanced based on a wider number of unlabeled samples and therefore better adapt to the corpus at hand. The experimental results reported here show that indeed the annotation accuracy improves when the training set is extended with automatically labeled samples.
  • Keywords
    Web sites; content-based retrieval; multimedia systems; text analysis; automatic concept detector refinement; content based analysis; content sharing Website; large-scale video semantic annotation; multimedia document; online video annotation; search engine; textual information; training set; Accuracy; Detectors; Feature extraction; Multimedia communication; Semantics; Streaming media; Training; Annotation; Dataset; Online Video;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Semantic Computing (ICSC), 2010 IEEE Fourth International Conference on
  • Conference_Location
    Pittsburgh, PA
  • Print_ISBN
    978-1-4244-7912-2
  • Electronic_ISBN
    978-0-7695-4154-9
  • Type

    conf

  • DOI
    10.1109/ICSC.2010.15
  • Filename
    5628899