DocumentCode :
3060308
Title :
Tracking recurrent concept drift in streaming data using ensemble classifiers
Author :
Ramamurthy, Sasthakumar ; Bhatnagar, Raj
Author_Institution :
Univ. of Cincinnati, Cincinnati
fYear :
2007
fDate :
13-15 Dec. 2007
Firstpage :
404
Lastpage :
409
Abstract :
Streaming data may consist of multiple drifting concepts each having its own underlying data distribution. We present an ensemble learning based approach to handle the data streams having multiple underlying modes. We build a global set of classifiers from sequential data chunks; ensembles are then selected from this global set of classifiers, and new classifiers created if needed, to represent the current concept in the stream. The system is capable of performing any-time classification and to detect concept drift in the stream. In streaming data historic concepts are likely to reappear so we don´t delete any of the historic classifiers. Instead, we judiciously select only pertinent classifiers from the global set while forming the ensemble set for a classification task.
Keywords :
data mining; learning (artificial intelligence); pattern classification; any-time classification; data distribution; data stream handling; ensemble classifiers; ensemble learning based approach; recurrent concept drift tracking; stream data mining; Application software; Computer science; Data flow computing; Data mining; Decision trees; Environmental economics; Information filtering; Machine learning; Statistics; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Applications, 2007. ICMLA 2007. Sixth International Conference on
Conference_Location :
Cincinnati, OH
Print_ISBN :
978-0-7695-3069-7
Type :
conf
DOI :
10.1109/ICMLA.2007.80
Filename :
4457264
Link To Document :
بازگشت