DocumentCode :
2144370
Title :
A Double-Window-Based Classification Algorithm for Concept Drifting Data Streams
Author :
Zhu, Qun ; Hu, Xuegang ; Zhang, Yuhong ; Li, Peipei ; Wu, Xindong
Author_Institution :
Sch. of Comput. Sci. & Inf. Eng., Hefei Univ. of Technol., Hefei, China
fYear :
2010
fDate :
14-16 Aug. 2010
Firstpage :
639
Lastpage :
644
Abstract :
Tracking concept drifts in data streams has recently become a hot topic in data mining. Most of the existing work is built on a single-window-based mechanism to detect concept drifts. Due to the inherent limitation of the single-window-based mechanism, it is a challenge to handle different types of drifts. Motivated by this, a new classification algorithm based on a double-window mechanism for handling various concept drifting data streams (named DWCDS) is proposed in this paper. In terms of an ensemble classifier in random decision trees, a double-window-based mechanism is presented to detect concept drifts periodically, and the model is updated dynamically to adapt to concept drifts. Extensive studies on both synthetic and real-word data demonstrate that DWCDS could quickly and efficiently detect concept drifts from streaming data, and the performance on the robustness to noise and the accuracy of classification is also improved significantly.
Keywords :
data mining; decision trees; pattern classification; concept drift tracking; concept drifting data stream; data handling; data mining; double window-based classification algorithm; random decision trees; single-window-based mechanism; Accuracy; Classification algorithms; Classification tree analysis; Databases; Error analysis; Noise; Robustness; classification; concept drift; data stream;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Granular Computing (GrC), 2010 IEEE International Conference on
Conference_Location :
San Jose, CA
Print_ISBN :
978-1-4244-7964-1
Type :
conf
DOI :
10.1109/GrC.2010.125
Filename :
5576021
Link To Document :
بازگشت