• DocumentCode
    2740926
  • Title

    Evalution of Random Forest Ensemble Classification for Land Cover Mapping Using TM and Ancillary Geographical Data

  • Author

    Na, Xiaodong ; Zang, Shuying ; Wang, Jianhua

  • Author_Institution
    Inst. of Comput. Sci. & Inf. Eng., Harbin Normal Univ., Harbin, China
  • Volume
    2
  • fYear
    2009
  • fDate
    14-16 Aug. 2009
  • Firstpage
    89
  • Lastpage
    93
  • Abstract
    Large area land cover mapping, involving large volumes of data, are becoming more prevalent in remote sensing applications. Thus there is a pressing need for increased automation in the land cover mapping process. The main objective of this research was to map land cover in the Small Sanjiang Plain where marsh distributed concentively combined Landsat TM imagery with ancillary geographical data and compare the performance of three machine learning algrithms (MLAs) including random forest (RF), classification and regression tree (CART) and maximum likelihood classification (MLC). Comparisions were based on several criteria: overall accuracy, sensitivity to data set size and noise. Our results indicated that (1) Random Forest can achieve substantial improvements in accuracy over single classification trees and traditional MLC method, overall accuracy was 91.0%, kappa coefficient was 0.8943, with marsh class accuracy ranging from 77.4% to 90.0%; (2) Random forest was least sensitive to reduction in training sample size and it was most resistant to the presence of noise compared to CART and MLC. The comparison result revealed that random forest has potential to increase automation in large area land cover mapping while achieving reasonable map accuracy.
  • Keywords
    data analysis; forestry; geography; learning (artificial intelligence); pattern classification; regression analysis; terrain mapping; CART; Landsat TM imagery; ancillary geographical data; classification and regression tree; data set size sensitivity; kappa coefficient; land cover mapping; machine learning algrithms; marsh; maximum likelihood classification; noise sensitivity; random forest ensemble classification; remote sensing applications; small Sanjiang plain; Automation; Classification tree analysis; Computer science; Ecosystems; Fuzzy systems; Optical sensors; Regression tree analysis; Remote monitoring; Remote sensing; Satellites; Landsat TM; ancillary geographical data; land cover classification; random forest;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery, 2009. FSKD '09. Sixth International Conference on
  • Conference_Location
    Tianjin
  • Print_ISBN
    978-0-7695-3735-1
  • Type

    conf

  • DOI
    10.1109/FSKD.2009.165
  • Filename
    5358601