DocumentCode :
3499695
Title :
Semi-supervised learning in nonstationary environments
Author :
Ditzler, Gregory ; Polikar, Robi
Author_Institution :
Electr. & Comput. Eng. Dept., Rowan Univ., Glassboro, NJ, USA
fYear :
2011
fDate :
July 31 2011-Aug. 5 2011
Firstpage :
2741
Lastpage :
2748
Abstract :
Learning in nonstationary environments, also called learning concept drift, has been receiving increasing attention due to increasingly large number of applications that generate data with drifting distributions. These applications are usually associated with streaming data, either online or in batches, and concept drift algorithms are trained to detect and track the drifting concepts. While concept drift itself is a significantly more complex problem than the traditional machine learning paradigm of data coming from a fixed distribution, the problem is further complicated when obtaining labeled data is expensive, and training must rely, in part, on unlabelled data. Independently from concept drift research, semi-supervised approaches have been developed for learning from (limited) labeled and (abundant) unlabeled data; however, such approaches have been largely absent in concept drift literature. In this contribution, we describe an ensemble of classifiers based approach that takes advantage of both labeled and unlabeled data in addressing concept drift: available labeled data are used to generate classifiers, whose voting weights are determined based on the distances between Gaussian mixture model components trained on both labeled and unlabeled data in a drifting environment.
Keywords :
Gaussian processes; data analysis; learning (artificial intelligence); pattern classification; Gaussian mixture model; data classifier; drifting distribution; learning concept drift; machine learning; nonstationary environment; semisupervised learning; streaming data; unlabelled data; voting weight; Algorithm design and analysis; Classification algorithms; Clustering algorithms; Data models; Testing; Training; Training data; concept drift; ensemble systems; incremental learning; non-stationary environments; unlabeled data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks (IJCNN), The 2011 International Joint Conference on
Conference_Location :
San Jose, CA
ISSN :
2161-4393
Print_ISBN :
978-1-4244-9635-8
Type :
conf
DOI :
10.1109/IJCNN.2011.6033578
Filename :
6033578
Link To Document :
بازگشت