DocumentCode :
249607
Title :
Using ensemble margin to explore issues of training data imbalance and mislabeling on large area land cover classification
Author :
Mellor, Andrew ; Boukir, Samia ; Haywood, Andrew ; Jones, Simon
Author_Institution :
Sch. of Math. & Geospatial Sci., RMIT Univ., Melbourne, VIC, Australia
fYear :
2014
fDate :
27-30 Oct. 2014
Firstpage :
5067
Lastpage :
5071
Abstract :
This work introduces new ensemble margin criteria, to evaluate the performance of Random Forests (RF), in the context of large area land cover classification, using imbalanced and noisy training data. Experiments using binary and multiclass classification problems reveal insights into the behaviour of RF over big data, in which training data contains noise and may not be evenly distributed among classes. The margin-based RF performance evaluation is conducted using remote sensing and ancillary spatial data, across a 7.2 million hectare study area.
Keywords :
geographic information systems; pattern classification; remote sensing; RF; ancillary spatial data; binary classification; ensemble margin; imbalanced training data; large area land cover classification; mislabeling; multiclass classification; noisy training data; random forests; remote sensing; Accuracy; Entropy; Noise; Radio frequency; Remote sensing; Training; Training data; classification; ensemble margin; imbalance; mislabeling; remote sensing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Image Processing (ICIP), 2014 IEEE International Conference on
Conference_Location :
Paris
Type :
conf
DOI :
10.1109/ICIP.2014.7026026
Filename :
7026026
Link To Document :
بازگشت