• DocumentCode
    1490163
  • Title

    It´s All About the Data

  • Author

    Berg, Tamara L. ; Sorokin, Alexander ; Wang, Gang ; Forsyth, David Alexander ; Hoiem, Derek ; Endres, Ian ; Farhadi, Ali

  • Author_Institution
    Dept. of Comput. Sci., State Univ. of New York Stony Brook, Stony Brook, NY, USA
  • Volume
    98
  • Issue
    8
  • fYear
    2010
  • Firstpage
    1434
  • Lastpage
    1452
  • Abstract
    Modern computer vision research consumes labelled data in quantity, and building datasets has become an important activity. The Internet has become a tremendous resource for computer vision researchers. By seeing the Internet as a vast, slightly disorganized collection of visual data, we can build datasets. The key point is that visual data are surrounded by contextual information like text and HTML tags, which is a strong, if noisy, cue to what the visual data means. In a series of case studies, we illustrate how useful this contextual information is. It can be used to build a large and challenging labelled face dataset with no manual intervention. With very small amounts of manual labor, contextual data can be used together with image data to identify pictures of animals. In fact, these contextual data are sufficiently reliable that a very large pool of noisily tagged images can be used as a resource to build image features, which reliably improve on conventional visual features. By seeing the Internet as a marketplace that can connect sellers of annotation services to researchers, we can obtain accurately annotated datasets quickly and cheaply. We describe methods to prepare data, check quality, and set prices for work for this annotation process. The problems posed by attempting to collect very big research datasets are fertile for researchers because collecting datasets requires us to focus on two important questions: What makes a good picture? What is the meaning of a picture?
  • Keywords
    Internet; computer vision; data analysis; human computer interaction; image denoising; HTML tags; Internet; annotation services; computer vision research; contextual data; image features; labelled data; noisily tagged images; visual data; Animals; Application software; Computer science; Computer vision; Facebook; HTML; Image sampling; Search engines; Training data; Web and internet services; Web pages; Computer vision; Internet;
  • fLanguage
    English
  • Journal_Title
    Proceedings of the IEEE
  • Publisher
    ieee
  • ISSN
    0018-9219
  • Type

    jour

  • DOI
    10.1109/JPROC.2009.2032355
  • Filename
    5464301