• DocumentCode
    2651525
  • Title

    A Micro-cluster Based Ensemble Approach for Classifying Distributed Data Streams

  • Author

    Mao, Guojun ; Yang, Yi

  • Author_Institution
    Sch. of Inf. Technol., Beijing Univ. of Technol., Beijing, China
  • fYear
    2011
  • fDate
    7-9 Nov. 2011
  • Firstpage
    753
  • Lastpage
    759
  • Abstract
    Mining distributed data streams is a focus of much research in recent years, and it has brought many challenging problems. One of these problems is just learning and maintaining the global patterns from multiple data streams in distributed environments. In this paper, we discuss micro-cluster based classifying problems in distributed data streams, and propose the methods to mine data streams in the distributed environments oriented to both labeled and unlabeled data. For each local site, local micro-cluster based ensemble is used and its updating algorithms are designed. Making use of the time-based sliding window techniques, the local models in a fixed time-span are transferred to a central site after being generated in all local sites, and then the global patterns related to this time-span can be mined in the central site. In our methods, the global patterns are micro-cluster based rather than typical classifiers such decision trees, which can get expected classification accuracy when higher mining performance is assured. The experiment shows these methods are effective and efficient to classify multiple data streams in distributed environments.
  • Keywords
    data mining; decision trees; pattern classification; pattern clustering; decision trees; distributed data stream mining; distributed data streams classification; microcluster based classification problems; microcluster based ensemble approach; time based sliding window techniques; Accuracy; Classification algorithms; Clustering algorithms; Data mining; Data models; Decision trees; Distributed databases; distributed data stream; ensemble classifiing; micro-cluster; model combination;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Tools with Artificial Intelligence (ICTAI), 2011 23rd IEEE International Conference on
  • Conference_Location
    Boca Raton, FL
  • ISSN
    1082-3409
  • Print_ISBN
    978-1-4577-2068-0
  • Electronic_ISBN
    1082-3409
  • Type

    conf

  • DOI
    10.1109/ICTAI.2011.118
  • Filename
    6103409