DocumentCode :
484568
Title :
Generating High-Quality Training Data for Automated Land-Cover Mapping
Author :
Rebbapragada, U. ; Lomasky, R. ; Brodley, C.E. ; Friedl, M.A.
Author_Institution :
Dept. of Comput. Sci., Tufts Univ., Medford, MA
Volume :
4
fYear :
2008
fDate :
7-11 July 2008
Abstract :
This paper presents two machine learning techniques that greatly reduce the number of person-hours required to generate high-quality training data for land cover classification. The first technique uses active learning to guide the generation of training data by selecting only the most informative examples for labeling. The second technique identifies and mitigates the impact of mislabeled instances. Both techniques are tested on data from NASA´s Moderate Resolution Imaging Spectroradiometer (MODIS), which has required thousands of person hours to label. Our results shows that the active learning method requires fewer labeled examples than random sampling to produce a high quality classifier. Our results on class noise mitigation show that if mislabelings occur, we can further improve classifier accuracy, and that weighting instances by their label confidence outperforms an analogous method that discards suspected mislabelings. If combined, these methods have the potential to make training data generation a more efficient and reliable process.
Keywords :
geophysics computing; image classification; learning (artificial intelligence); terrain mapping; MODIS; Moderate Resolution Imaging Spectroradiometer; NASA; active learning method; analogous method; automated land-cover mapping; data generation; generate high-quality training data; labeling; land cover classification; machine learning techniques; mislabeled instances; Humans; Labeling; Learning systems; MODIS; Machine learning; Sampling methods; Support vector machine classification; Support vector machines; Testing; Training data; active learning; landcover classification; supervised learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Geoscience and Remote Sensing Symposium, 2008. IGARSS 2008. IEEE International
Conference_Location :
Boston, MA
Print_ISBN :
978-1-4244-2807-6
Electronic_ISBN :
978-1-4244-2808-3
Type :
conf
DOI :
10.1109/IGARSS.2008.4779779
Filename :
4779779
Link To Document :
بازگشت