• DocumentCode
    2209224
  • Title

    Active Learning with Human-Like Noisy Oracle

  • Author

    Du, Jun ; Ling, Charles X.

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Western Ontario, London, ON, Canada
  • fYear
    2010
  • fDate
    13-17 Dec. 2010
  • Firstpage
    797
  • Lastpage
    802
  • Abstract
    When active learning is applied to real-world applications, human experts usually act as oracles to provide labels. However, human make mistakes, thus noise might be introduced during the learning process. Most previous studies simplify the problem by assuming uniformly-distributed noise over the sample space. Such assumption, however, might fail to precisely reflect the human experts´ behaviour in real-world situations. In this paper, we therefore study active learning with such human-like oracles, by making a more realistic assumption that the noise is example-dependent (i.e., non-uniformly distributed over the sample space). More specifically, when the human-like oracle is highly confident in labelling examples, it is naturally less likely to provide incorrect answers, whereas when such confidence is low, the noise would be more likely to be introduced. Based on the analysis of such human-like oracle, we propose a generic yet simple active learning algorithm to simultaneously explore the unlabelled data and exploit the labelled data. Empirical study on both synthetic and real-world data sets verifies the superiority of the proposed algorithm, compared with the traditional uncertainty sampling.
  • Keywords
    data mining; learning (artificial intelligence); sampling methods; active learning; human experts; labelled data; noise; oracles; uncertainty sampling; unlabelled data; active learning; noise; oracle;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining (ICDM), 2010 IEEE 10th International Conference on
  • Conference_Location
    Sydney, NSW
  • ISSN
    1550-4786
  • Print_ISBN
    978-1-4244-9131-5
  • Electronic_ISBN
    1550-4786
  • Type

    conf

  • DOI
    10.1109/ICDM.2010.114
  • Filename
    5694041