DocumentCode
2765777
Title
A fast clustering algorithm based on grid and density
Author
Sun, Zhiwei ; Zhao, Zheng ; Wang, Hongmei ; Ma, Maode ; Zhang, Lianfang ; Shu, Yantai
Author_Institution
Sch. of Electron. & Inf. Eng., Tianjin Univ.
fYear
2005
fDate
1-4 May 2005
Firstpage
2276
Lastpage
2279
Abstract
The efficiency of data mining algorithms is a very important issue as data becoming larger and larger. Density-based clustering analysis can discover clusters with arbitrary shape and is insensitive to noise data. The advantage of grid-based clustering method is linear time complexity. In this paper, we present a new clustering algorithm CLUGD relying on grid and density. We first construct a grid of relevant portion. Then the algorithm finds references by grid and classifies these references to core references and bound references. Then it attaches the data of the bound references to the nearest core references and aggregation the core references in neighboring portions. At last, in-direct graph is used to classify these core references and maps cluster to original data. We performed an experimental evaluation of effectiveness and efficiency of CLUGD using synthetic data and the data of the SEQUOIA 2000 Benchmark. Both theory analysis and experimental results confirm that CLUGD can discover clusters with arbitrary shape and is insensitive to noise data. In the meanwhile, its executing efficiency is much higher than DBSCAN algorithm based on R*-tree
Keywords
data mining; graph theory; pattern clustering; CLUGD; R*-tree; SEQUOIA 2000 Benchmark; bound references; core references; data mining algorithms; density-based clustering analysis; fast clustering algorithm; grid-based clustering method; in-direct graph; linear time complexity; Algorithm design and analysis; Clustering algorithms; Clustering methods; Data engineering; Data mining; Noise shaping; Partitioning algorithms; Performance evaluation; Shape; Sun;
fLanguage
English
Publisher
ieee
Conference_Titel
Electrical and Computer Engineering, 2005. Canadian Conference on
Conference_Location
Saskatoon, Sask.
ISSN
0840-7789
Print_ISBN
0-7803-8885-2
Type
conf
DOI
10.1109/CCECE.2005.1557443
Filename
1557443
Link To Document