Title :
Self-adaptive clustering data stream algorithm based on SSMC-tree
Author :
Kehua Yang ; Heqing Gao ; Lin Chen ; Qiong Yuan
Author_Institution :
Lab. of Embedded Syst. & Network, Hunan Univ., Changsha, China
Abstract :
Due to the data stream is real-time, fast, unlimited, one-pass, clustering data stream requires algorithms which are capable to process the data stream in the limited time and memory. In this paper, we propose a clustering algorithm based on the improved similarity search tree (SSMC-Tree), and introduce buffer, hitchhike processing and local aggregation strategy, it can adapt to different speed data stream. We adopt an outlier processing mechanism by introducing potential core-micro-cluster buffer and outlier micro-cluster buffer to process noise in the data stream. Experimental results show that our algorithm can adapt to the high-speed data stream with noise.
Keywords :
pattern clustering; trees (mathematics); SSMC-tree; high-speed data stream; hitchhike processing; local aggregation strategy; outlier microcluster buffer; outlier processing mechanism; potential core-microcluster buffer; self-adaptive data stream clustering algorithm; simiiarity search with microclusters tree; Clustering algorithms; Data Mining; Data Stream; Hierarchic Clustering;
Conference_Titel :
Software Engineering and Service Science (ICSESS), 2013 4th IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4673-4997-0
DOI :
10.1109/ICSESS.2013.6615320