DocumentCode :
2977544
Title :
A Clustering Algorithm Based on Density-Grid for Stream Data
Author :
Dandan Zhang ; Hui Tian ; Yingpeng Sang ; Yidong Li ; Yanbo Wu ; Jun Wu ; Hong Shen
Author_Institution :
Sch. of Comput. & Inf. Technol., Beijing Jiaotong Univ., Beijing, China
fYear :
2012
fDate :
14-16 Dec. 2012
Firstpage :
398
Lastpage :
403
Abstract :
Many real applications, such as network traffic monitoring, intrusion detection, satellite remote sensing, and electronic business, generate data in the form of a stream arriving continuously at high speed. Clustering is an important data analysis tool for knowledge discovery. Compared with traditional clustering algorithms, clustering stream data is an important and challenging problem which has attracted many researchers. Clustering stream data is facing two main challenges. First, as the data is continuously arriving with high rate and the computer storage capacity is limited, raw data can only be scaned in one pass. Second, stream data is always changing with time, so viewing a data stream as a set of static data can deteriorate the clustering quality. In fact, users are more concerned with the evolving behaviors of clusters which can help people making correct decisions. This paper proposes a density-grid based clustering algorithm, PKS-Stream-I, for stream data. It is an optimization of PKS-Stream in density detection period selection, sporadic grid detection and removal. Empirical results show the proposed method yields out better performance.
Keywords :
grid computing; pattern clustering; PKS-Stream-I algorithm; clustering quality; data analysis tool; density detection period selection; density-grid based clustering algorithm; knowledge discovery; sporadic grid detection; sporadic grid removal; stream data clustering; Aerospace electronics; Clustering algorithms; Entropy; Indexes; Noise; Partitioning algorithms; Shape; Clustering; Index Tree; density-grid; stream data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Computing, Applications and Technologies (PDCAT), 2012 13th International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-0-7695-4879-1
Type :
conf
DOI :
10.1109/PDCAT.2012.13
Filename :
6589311
Link To Document :
بازگشت