DocumentCode
2209224
Title
Active Learning with Human-Like Noisy Oracle
Author
Du, Jun ; Ling, Charles X.
Author_Institution
Dept. of Comput. Sci., Univ. of Western Ontario, London, ON, Canada
fYear
2010
fDate
13-17 Dec. 2010
Firstpage
797
Lastpage
802
Abstract
When active learning is applied to real-world applications, human experts usually act as oracles to provide labels. However, human make mistakes, thus noise might be introduced during the learning process. Most previous studies simplify the problem by assuming uniformly-distributed noise over the sample space. Such assumption, however, might fail to precisely reflect the human experts´ behaviour in real-world situations. In this paper, we therefore study active learning with such human-like oracles, by making a more realistic assumption that the noise is example-dependent (i.e., non-uniformly distributed over the sample space). More specifically, when the human-like oracle is highly confident in labelling examples, it is naturally less likely to provide incorrect answers, whereas when such confidence is low, the noise would be more likely to be introduced. Based on the analysis of such human-like oracle, we propose a generic yet simple active learning algorithm to simultaneously explore the unlabelled data and exploit the labelled data. Empirical study on both synthetic and real-world data sets verifies the superiority of the proposed algorithm, compared with the traditional uncertainty sampling.
Keywords
data mining; learning (artificial intelligence); sampling methods; active learning; human experts; labelled data; noise; oracles; uncertainty sampling; unlabelled data; active learning; noise; oracle;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining (ICDM), 2010 IEEE 10th International Conference on
Conference_Location
Sydney, NSW
ISSN
1550-4786
Print_ISBN
978-1-4244-9131-5
Electronic_ISBN
1550-4786
Type
conf
DOI
10.1109/ICDM.2010.114
Filename
5694041
Link To Document