DocumentCode
2265064
Title
An Active Learning Method for Mining Time-Changing Data Streams
Author
Huang, Shucheng
Author_Institution
Sch. of Comput. Sci. & Eng., JiangSu Univ. of Sci. & Technol., Zhenjiang
Volume
2
fYear
2008
fDate
20-22 Dec. 2008
Firstpage
548
Lastpage
552
Abstract
Many applications generate continuous, time-changing data streams. Mining it for an adaptive classifier is of great interest and challenge. Many previous efforts impractically assume the labeled data is available and can be mined at anytime. In this paper, we propose an effective active learning method to mine time-changing data streams efficiently. It designs a way to monitoring the possible changes on the fly without need knowing the labeled data. Upon the suspected changes are indicated, it employs a light-weight uncertainty sampling algorithm to choose the most informative instances to label. With these representative labeled instances, it tests the significance of the suspected changes. If the changes indeed cause significant performance deterioration of the current classifier, it reconstructs the old model. Thus, our method can reliably detect significant changes, quickly adapt to concept-drift, and result effective models. Experimental results from real-world data confirm the advantages of our method.
Keywords
data mining; database management systems; learning (artificial intelligence); pattern classification; active learning method; adaptive classifying model; light-weight uncertainty sampling algorithm; representative labeled instance; time-changing data stream mining; Application software; Costs; Data mining; Decision trees; Frequency; Learning systems; Monitoring; Sampling methods; Statistical distributions; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Information Technology Application, 2008. IITA '08. Second International Symposium on
Conference_Location
Shanghai
Print_ISBN
978-0-7695-3497-8
Type
conf
DOI
10.1109/IITA.2008.136
Filename
4739824
Link To Document