DocumentCode :
3007789
Title :
A Grid and Density-Based Clustering Algorithm for Processing Data Stream
Author :
Jia, Chen ; Tan, Chengyu ; Yong, Ai
Author_Institution :
State Key Lab. of Software Eng., Wuhan Univ., Wuhan
fYear :
2008
fDate :
25-26 Sept. 2008
Firstpage :
517
Lastpage :
521
Abstract :
This paper proposes DD-Stream, a framework for density-based clustering stream data. The algorithm adopts a density decaying technique to capture the evolving data stream and extracts the boundary point of grid by the DCQ-means algorithm. Our method resolving the problem of evolving automatic clustering of real-time data streams, cannot only find arbitrary shaped clusters with noise, but also avoid the clustering quality problems caused by discarding the boundary point of grid, our algorithm has better scalability in processing large-scale and high dimensional stream data as well.
Keywords :
computational geometry; data mining; pattern clustering; very large databases; DCQ-means algorithm; DD-stream; boundary point extraction; data stream mining; density decaying technique; density-based clustering algorithm; grid clustering algorithm; high-dimensional data stream; large-scale data stream processing; Clustering algorithms; Costs; Data mining; Genetics; Grid computing; Laboratories; Noise shaping; Partitioning algorithms; Shape; Software algorithms; Cluster; Data stream; Grid;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Genetic and Evolutionary Computing, 2008. WGEC '08. Second International Conference on
Conference_Location :
Hubei
Print_ISBN :
978-0-7695-3334-6
Type :
conf
DOI :
10.1109/WGEC.2008.32
Filename :
4637498
Link To Document :
بازگشت