• DocumentCode
    80007
  • Title

    Toward a Semiautomatic Machine Learning Retrieval of Biophysical Parameters

  • Author

    Rivera Caicedo, Juan Pablo ; Verrelst, Jochem ; Munoz-Mari, Jordi ; Moreno, J. ; Camps-Valls, G.

  • Author_Institution
    Image Process. Lab. (IPL), Univ. de Valencia, València, Spain
  • Volume
    7
  • Issue
    4
  • fYear
    2014
  • fDate
    Apr-14
  • Firstpage
    1249
  • Lastpage
    1259
  • Abstract
    Biophysical parameters such as leaf chlorophyll content (LCC) and leaf area index (LAI) are standard vegetation products that can be retrieved from Earth observation imagery. This paper introduces a new machine learning regression algorithms (MLRAs) toolbox into the scientific Automated Radiative Transfer Models Operator (ARTMO) software package. ARTMO facilitates retrieval of biophysical parameters from remote observations in a MATLAB graphical user interface (GUI) environment. The MLRA toolbox enables analyzing the predictive power of various MLRAs in a semiautomatic and systematic manner, and applying a selected MLRA to multispectral or hyperspectral imagery for mapping applications. It contains both linear and nonlinear state-of-the-art regression algorithms, in particular linear feature extraction via principal component regression (PCR), partial least squares regression (PLSR), decision trees (DTs), neural networks (NNs), kernel ridge regression (KRR), and Gaussian processes regression (GPR). The performance of multiple implemented regression strategies has been evaluated against the SPARC dataset (Barrax, Spain) and simulated Sentinel-2 (8 bands), CHRIS (62 bands) and HyMap (125 bands) observations. In general, nonlinear regression algorithms (NN, KRR, and GPR) outperformed linear techniques (PCR and PLSR) in terms of accuracy, bias, and robustness. Most robust results along gradients of training/validation partitioning and noise variance were obtained by KRR while GPR delivered most accurate estimations. We applied a GPR model to a hyperspectral HyMap flightline to map LCC and LAI. We exploited the associated uncertainty intervals to gain insight in the per-pixel performance of the model.
  • Keywords
    Gaussian processes; decision trees; feature extraction; geophysical image processing; graphical user interfaces; hyperspectral imaging; learning (artificial intelligence); least squares approximations; mathematics computing; neural nets; principal component analysis; regression analysis; vegetation mapping; CHRIS observation; Gaussian processes regression; MATLAB graphical user interface; SPARC dataset; biophysical parameters; decision trees; hyperspectral HyMap flightline; hyperspectral imagery; kernel ridge regression; leaf area index; leaf chlorophyll content; linear feature extraction; machine learning regression algorithms; multispectral imagery; neural networks; noise variance; nonlinear regression algorithms; partial least squares regression; partitioning; principal component regression; scientific Automated Radiative Transfer Models Operator software package; semiautomatic machine learning retrieval; simulated Sentinel-2 observation; vegetation; Artificial neural networks; Biological system modeling; Graphical user interfaces; Ground penetrating radar; Kernel; Remote sensing; Training; Biophysical parameter retrieval; CHRIS; HyMap; Sentinel-2 (S2); graphical user interface (GUI) toolbox; leaf area index (LAI); leaf chlorophyll content (LCC); machine learning; nonparametric regression;
  • fLanguage
    English
  • Journal_Title
    Selected Topics in Applied Earth Observations and Remote Sensing, IEEE Journal of
  • Publisher
    ieee
  • ISSN
    1939-1404
  • Type

    jour

  • DOI
    10.1109/JSTARS.2014.2298752
  • Filename
    6727447