Title :
A Micro-cluster Based Ensemble Approach for Classifying Distributed Data Streams
Author :
Mao, Guojun ; Yang, Yi
Author_Institution :
Sch. of Inf. Technol., Beijing Univ. of Technol., Beijing, China
Abstract :
Mining distributed data streams is a focus of much research in recent years, and it has brought many challenging problems. One of these problems is just learning and maintaining the global patterns from multiple data streams in distributed environments. In this paper, we discuss micro-cluster based classifying problems in distributed data streams, and propose the methods to mine data streams in the distributed environments oriented to both labeled and unlabeled data. For each local site, local micro-cluster based ensemble is used and its updating algorithms are designed. Making use of the time-based sliding window techniques, the local models in a fixed time-span are transferred to a central site after being generated in all local sites, and then the global patterns related to this time-span can be mined in the central site. In our methods, the global patterns are micro-cluster based rather than typical classifiers such decision trees, which can get expected classification accuracy when higher mining performance is assured. The experiment shows these methods are effective and efficient to classify multiple data streams in distributed environments.
Keywords :
data mining; decision trees; pattern classification; pattern clustering; decision trees; distributed data stream mining; distributed data streams classification; microcluster based classification problems; microcluster based ensemble approach; time based sliding window techniques; Accuracy; Classification algorithms; Clustering algorithms; Data mining; Data models; Decision trees; Distributed databases; distributed data stream; ensemble classifiing; micro-cluster; model combination;
Conference_Titel :
Tools with Artificial Intelligence (ICTAI), 2011 23rd IEEE International Conference on
Conference_Location :
Boca Raton, FL
Print_ISBN :
978-1-4577-2068-0
Electronic_ISBN :
1082-3409
DOI :
10.1109/ICTAI.2011.118