Title :
A Grid and Density-Based Clustering Algorithm for Processing Data Stream
Author :
Jia, Chen ; Tan, Chengyu ; Yong, Ai
Author_Institution :
State Key Lab. of Software Eng., Wuhan Univ., Wuhan
Abstract :
This paper proposes DD-Stream, a framework for density-based clustering stream data. The algorithm adopts a density decaying technique to capture the evolving data stream and extracts the boundary point of grid by the DCQ-means algorithm. Our method resolving the problem of evolving automatic clustering of real-time data streams, cannot only find arbitrary shaped clusters with noise, but also avoid the clustering quality problems caused by discarding the boundary point of grid, our algorithm has better scalability in processing large-scale and high dimensional stream data as well.
Keywords :
computational geometry; data mining; pattern clustering; very large databases; DCQ-means algorithm; DD-stream; boundary point extraction; data stream mining; density decaying technique; density-based clustering algorithm; grid clustering algorithm; high-dimensional data stream; large-scale data stream processing; Clustering algorithms; Costs; Data mining; Genetics; Grid computing; Laboratories; Noise shaping; Partitioning algorithms; Shape; Software algorithms; Cluster; Data stream; Grid;
Conference_Titel :
Genetic and Evolutionary Computing, 2008. WGEC '08. Second International Conference on
Conference_Location :
Hubei
Print_ISBN :
978-0-7695-3334-6
DOI :
10.1109/WGEC.2008.32