Title :
An ensemble learning approach for data stream clustering
Author :
Fathzadeh, Ramin ; Mokhtari, Vahid
Author_Institution :
Dept. of Comput. Eng., Qazvin Islamic Azad Univ., Qazvin, Iran
Abstract :
Data stream clustering is one of the most interesting issues in data mining which refers to immense of data that brought extreme restrictions to process. Ensemble Clustering has recently been paid attention as a robust method on the basis of recruiting several algorithms to analyze data and combine their results to gain more accurate analysis than every individual algorithm. Finding more accurate clusters, extract unknown structures of data and scalability are some advantages of ensemble clustering. Besides, there is no need prior knowledge about input data structure or algorithm. Accordingly, developing an ensemble clustering method to extract outstanding clusters from data stream is the theme of this article. Hence, the algorithm of Stream Ensemble Fuzzy C-Means, SEFCM, has been proposed. SEFCM comprised of three stages; 1) divide data stream to smaller blocks; 2) cluster every blocks using ensemble clustering algorithm; and 3) combine the concluding partitions and extract an absolute partition. Fulfilling experimental results of the proposed algorithm demonstrate the robustness of SEFCM to produce excellent clusters.
Keywords :
data mining; data structures; fuzzy set theory; learning (artificial intelligence); pattern clustering; SEFCM algorithm; absolute partition extraction; data mining; data stream clustering; ensemble clustering method; ensemble learning approach; input data structure; scalability; stream ensemble fuzzy C-means algorithm; Accuracy; Algorithm design and analysis; Clustering algorithms; Data mining; Indexes; Partitioning algorithms; Time complexity; SEFCM; co-occurrence matrix; data stream; ensemble clustering;
Conference_Titel :
Electrical Engineering (ICEE), 2013 21st Iranian Conference on
Conference_Location :
Mashhad
DOI :
10.1109/IranianCEE.2013.6599871