Title :
An Active Learning Method for Mining Time-Changing Data Streams
Author_Institution :
Sch. of Comput. Sci. & Eng., JiangSu Univ. of Sci. & Technol., Zhenjiang
Abstract :
Many applications generate continuous, time-changing data streams. Mining it for an adaptive classifier is of great interest and challenge. Many previous efforts impractically assume the labeled data is available and can be mined at anytime. In this paper, we propose an effective active learning method to mine time-changing data streams efficiently. It designs a way to monitoring the possible changes on the fly without need knowing the labeled data. Upon the suspected changes are indicated, it employs a light-weight uncertainty sampling algorithm to choose the most informative instances to label. With these representative labeled instances, it tests the significance of the suspected changes. If the changes indeed cause significant performance deterioration of the current classifier, it reconstructs the old model. Thus, our method can reliably detect significant changes, quickly adapt to concept-drift, and result effective models. Experimental results from real-world data confirm the advantages of our method.
Keywords :
data mining; database management systems; learning (artificial intelligence); pattern classification; active learning method; adaptive classifying model; light-weight uncertainty sampling algorithm; representative labeled instance; time-changing data stream mining; Application software; Costs; Data mining; Decision trees; Frequency; Learning systems; Monitoring; Sampling methods; Statistical distributions; Training data;
Conference_Titel :
Intelligent Information Technology Application, 2008. IITA '08. Second International Symposium on
Conference_Location :
Shanghai
Print_ISBN :
978-0-7695-3497-8
DOI :
10.1109/IITA.2008.136