Title :
A Double-Window-Based Classification Algorithm for Concept Drifting Data Streams
Author :
Zhu, Qun ; Hu, Xuegang ; Zhang, Yuhong ; Li, Peipei ; Wu, Xindong
Author_Institution :
Sch. of Comput. Sci. & Inf. Eng., Hefei Univ. of Technol., Hefei, China
Abstract :
Tracking concept drifts in data streams has recently become a hot topic in data mining. Most of the existing work is built on a single-window-based mechanism to detect concept drifts. Due to the inherent limitation of the single-window-based mechanism, it is a challenge to handle different types of drifts. Motivated by this, a new classification algorithm based on a double-window mechanism for handling various concept drifting data streams (named DWCDS) is proposed in this paper. In terms of an ensemble classifier in random decision trees, a double-window-based mechanism is presented to detect concept drifts periodically, and the model is updated dynamically to adapt to concept drifts. Extensive studies on both synthetic and real-word data demonstrate that DWCDS could quickly and efficiently detect concept drifts from streaming data, and the performance on the robustness to noise and the accuracy of classification is also improved significantly.
Keywords :
data mining; decision trees; pattern classification; concept drift tracking; concept drifting data stream; data handling; data mining; double window-based classification algorithm; random decision trees; single-window-based mechanism; Accuracy; Classification algorithms; Classification tree analysis; Databases; Error analysis; Noise; Robustness; classification; concept drift; data stream;
Conference_Titel :
Granular Computing (GrC), 2010 IEEE International Conference on
Conference_Location :
San Jose, CA
Print_ISBN :
978-1-4244-7964-1
DOI :
10.1109/GrC.2010.125