DocumentCode
2983572
Title
Active Label Correction
Author
Rebbapragada, Umaa ; Brodley, Carla E. ; Sulla-Menashe, Damien ; Friedl, M.A.
Author_Institution
Jet Propulsion Lab., California Inst. of Technol., Pasadena, CA, USA
fYear
2012
fDate
10-13 Dec. 2012
Firstpage
1080
Lastpage
1085
Abstract
Active Label Correction (ALC) is an interactive method that cleans an established training set of mislabeled examples in conjunction with a domain expert. ALC presumes that the expert who conducts this review is either more accurate than the original annotator or has access to additional resources that ensure a high quality label. A high-cost re-review is possible because ALC proceeds iteratively, scoring the full training set but selecting only small batches of examples that are likely mislabeled. The expert reviews each batch and corrects any mislabeled examples, after which the classifier is retrained and the process repeats until the expert terminates it. We compare several instantiations of ALC to fully-automated methods that attempt to discard or correct label noise in a single pass. Our empirical results show that ALC outperforms single-pass methods in terms of selection efficiency and classifier accuracy. We evaluate the best ALC instantiation on our motivating task of detecting mislabeled and poorly formulated sites within a land cover classification training set from the geography domain.
Keywords
geography; learning (artificial intelligence); pattern classification; terrain mapping; ALC method; active label correction; classifier accuracy; classifier retraining; domain expert; geography domain; land cover classification training; selection efficiency; single-pass method; Accuracy; Labeling; Noise; Noise level; Training; Training data; Uncertainty; data cleaning; label noise; land cover classification; supervised learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining (ICDM), 2012 IEEE 12th International Conference on
Conference_Location
Brussels
ISSN
1550-4786
Print_ISBN
978-1-4673-4649-8
Type
conf
DOI
10.1109/ICDM.2012.162
Filename
6413805
Link To Document