DocumentCode
80007
Title
Toward a Semiautomatic Machine Learning Retrieval of Biophysical Parameters
Author
Rivera Caicedo, Juan Pablo ; Verrelst, Jochem ; Munoz-Mari, Jordi ; Moreno, J. ; Camps-Valls, G.
Author_Institution
Image Process. Lab. (IPL), Univ. de Valencia, València, Spain
Volume
7
Issue
4
fYear
2014
fDate
Apr-14
Firstpage
1249
Lastpage
1259
Abstract
Biophysical parameters such as leaf chlorophyll content (LCC) and leaf area index (LAI) are standard vegetation products that can be retrieved from Earth observation imagery. This paper introduces a new machine learning regression algorithms (MLRAs) toolbox into the scientific Automated Radiative Transfer Models Operator (ARTMO) software package. ARTMO facilitates retrieval of biophysical parameters from remote observations in a MATLAB graphical user interface (GUI) environment. The MLRA toolbox enables analyzing the predictive power of various MLRAs in a semiautomatic and systematic manner, and applying a selected MLRA to multispectral or hyperspectral imagery for mapping applications. It contains both linear and nonlinear state-of-the-art regression algorithms, in particular linear feature extraction via principal component regression (PCR), partial least squares regression (PLSR), decision trees (DTs), neural networks (NNs), kernel ridge regression (KRR), and Gaussian processes regression (GPR). The performance of multiple implemented regression strategies has been evaluated against the SPARC dataset (Barrax, Spain) and simulated Sentinel-2 (8 bands), CHRIS (62 bands) and HyMap (125 bands) observations. In general, nonlinear regression algorithms (NN, KRR, and GPR) outperformed linear techniques (PCR and PLSR) in terms of accuracy, bias, and robustness. Most robust results along gradients of training/validation partitioning and noise variance were obtained by KRR while GPR delivered most accurate estimations. We applied a GPR model to a hyperspectral HyMap flightline to map LCC and LAI. We exploited the associated uncertainty intervals to gain insight in the per-pixel performance of the model.
Keywords
Gaussian processes; decision trees; feature extraction; geophysical image processing; graphical user interfaces; hyperspectral imaging; learning (artificial intelligence); least squares approximations; mathematics computing; neural nets; principal component analysis; regression analysis; vegetation mapping; CHRIS observation; Gaussian processes regression; MATLAB graphical user interface; SPARC dataset; biophysical parameters; decision trees; hyperspectral HyMap flightline; hyperspectral imagery; kernel ridge regression; leaf area index; leaf chlorophyll content; linear feature extraction; machine learning regression algorithms; multispectral imagery; neural networks; noise variance; nonlinear regression algorithms; partial least squares regression; partitioning; principal component regression; scientific Automated Radiative Transfer Models Operator software package; semiautomatic machine learning retrieval; simulated Sentinel-2 observation; vegetation; Artificial neural networks; Biological system modeling; Graphical user interfaces; Ground penetrating radar; Kernel; Remote sensing; Training; Biophysical parameter retrieval; CHRIS; HyMap; Sentinel-2 (S2); graphical user interface (GUI) toolbox; leaf area index (LAI); leaf chlorophyll content (LCC); machine learning; nonparametric regression;
fLanguage
English
Journal_Title
Selected Topics in Applied Earth Observations and Remote Sensing, IEEE Journal of
Publisher
ieee
ISSN
1939-1404
Type
jour
DOI
10.1109/JSTARS.2014.2298752
Filename
6727447
Link To Document