• DocumentCode
    140901
  • Title

    Crowd-powered find algorithms

  • Author

    Das Sarma, Akash ; Parameswaran, Aditya ; Garcia-Molina, Hector ; Halevy, A.

  • Author_Institution
    ClearList Inc., USA
  • fYear
    2014
  • fDate
    March 31 2014-April 4 2014
  • Firstpage
    964
  • Lastpage
    975
  • Abstract
    We consider the problem of using humans to find a bounded number of items satisfying certain properties, from a data set. For instance, we may want humans to identify a select number of travel photos from a data set of photos to display on a travel website, or a candidate set of resumes that meet certain requirements from a large pool of applicants. Since data sets can be enormous, and since monetary cost and latency of data processing with humans can be large, optimizing the use of humans for finding items is an important challenge. We formally define the problem using the metrics of cost and time, and design optimal algorithms that span the skyline of cost and time, i.e., we provide designers the ability to control the cost vs. time trade-off. We study the deterministic as well as error-prone human answer settings, along with multiplicative and additive approximations. Lastly, we study how we may design algorithms with specific expected cost and time measures.
  • Keywords
    Web sites; data handling; social sciences computing; additive approximations; cost control; crowd-powered find algorithms; data processing; data set; design algorithms; deterministic human answer settings; error-prone human answer settings; monetary cost; multiplicative approximations; optimal algorithms; time measures; time trade-off; travel Website; travel photos; Algorithm design and analysis; Approximation algorithms; Approximation methods; Databases; Machine learning algorithms; Measurement; Poles and towers;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering (ICDE), 2014 IEEE 30th International Conference on
  • Conference_Location
    Chicago, IL
  • Type

    conf

  • DOI
    10.1109/ICDE.2014.6816715
  • Filename
    6816715