Title :
A semi-supervised learning method for remote sensing data mining
Author :
Vatsavai, Ranga Raju ; Shekhar, Shashi ; Burk, Thomas E.
Author_Institution :
Dept. of Comput. Sci. & Eng., Minnesota Univ., Minneapolis, MN
Abstract :
New approaches are needed to extract useful patterns from increasingly large multi-spectral remote sensing image databases in order to understand global climatic changes, vegetation dynamics, ocean processes, etc. Supervised learning, which is often used in land cover (thematic) classification of remote sensing imagery, requires large amounts of accurate training data. However, in many situations it is very difficult to collect labels for all training samples. In this paper we explore methods that utilize unlabeled samples in supervised learning for thematic information extraction from remote sensing imagery. Our objectives are to understand the impact of parameter estimation with small learning samples on classification accuracy, and to augment the parameter estimation with unlabeled training samples to improve land cover predictions. We have developed a semi-supervised learning method based on the expectation-maximization (EM) algorithm, and maximum likelihood and maximum a posteriori classifiers. This scheme utilizes a small set of labeled and a large number of unlabeled training samples. We have conducted several experiments on multi-spectral images to understand the impact of unlabeled samples on the classification performance. Our study shows that though in general classification accuracy improves with the addition of unlabeled training samples, it is not guaranteed to get consistently higher accuracies unless sufficient care is exercised when designing a semi-supervised classifier
Keywords :
data mining; expectation-maximisation algorithm; learning (artificial intelligence); parameter estimation; pattern classification; visual databases; expectation-maximization algorithm; land cover classification; land cover predictions; maximum a posteriori classifier; maximum likelihood classifier; multispectral remote sensing image databases; parameter estimation; pattern extraction; remote sensing data mining; semisupervised learning; thematic information extraction; unlabeled training samples; Data mining; Image databases; Maximum likelihood estimation; Oceans; Parameter estimation; Remote sensing; Semisupervised learning; Supervised learning; Training data; Vegetation mapping; EM; GMM; MAP; MLE; semi-supervised learning;
Conference_Titel :
Tools with Artificial Intelligence, 2005. ICTAI 05. 17th IEEE International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
0-7695-2488-5
DOI :
10.1109/ICTAI.2005.17