Title :
Semisupervised Kernel Feature Extraction for Remote Sensing Image Analysis
Author :
Izquierdo-Verdiguier, Emma ; Gomez-Chova, Luis ; Bruzzone, Lorenzo ; Camps-Valls, G.
Author_Institution :
Image Process. Lab. (IPL), Univ. of Valencia, València, Spain
Abstract :
This paper presents a novel semisupervised kernel partial least squares (KPLS) algorithm for nonlinear feature extraction to tackle both land-cover classification and biophysical parameter retrieval problems. The proposed method finds projections of the original input data that align with the target variable (labels) and incorporates the wealth of unlabeled information to deal with low-sized or underrepresented data sets. The method relies on combining two kernel functions: the standard radial-basis-function kernel based on labeled information and a generative, i.e., probabilistic, kernel directly learned by clustering the data many times and at different scales across the data manifold. The construction of the kernel is very simple and intuitive: Two samples should belong to the same class if they consistently belong to the same clusters at different scales. The effectiveness of the proposed method is successfully illustrated in multi- and hyperspectral remote sensing image classification and biophysical parameter estimation problems. Accuracy improvements in the range between +5% and 15% over standard principal component analysis (PCA), +4% and 15% over kernel PCA, and +3% and 10% over KPLS are obtained on several images. The average gain in the root-mean-square error of +5% and reductions in bias estimates of +3% are obtained for biophysical parameter retrieval compared to standard PCA feature extraction.
Keywords :
feature extraction; geophysical image processing; image classification; image representation; image retrieval; land cover; least squares approximations; mean square error methods; parameter estimation; pattern clustering; principal component analysis; radial basis function networks; remote sensing; KPLS algorithm; PCA; biophysical parameter estimation problem; biophysical parameter retrieval problem; data clustering; generative probabilistic kernel; hyperspectral remote sensing image classification; land-cover classification; low-sized underrepresented data set; multispectral remote sensing image classification; principal component analysis; remote sensing image analysis; root-mean-square error; semisupervised kernel feature extraction; semisupervised kernel partial least square algorithm; standard radial-basis-function kernel; Data mining; Feature extraction; Kernel; Principal component analysis; Remote sensing; Standards; Vectors; Biophysical parameter estimation; classification; clustering; feature extraction; generative kernels; kernel methods; partial least squares (PLS); principal component analysis (PCA); semisupervised learning;
Journal_Title :
Geoscience and Remote Sensing, IEEE Transactions on
DOI :
10.1109/TGRS.2013.2290372