Title :
An Efficient Clustering Algorithm for 2D Multi-density Dataset in Large Database
Author :
Xia, Ying ; Wang, Guoyin ; Gao, Song
Author_Institution :
Southwest Jiaotong Univ., Chengdu
Abstract :
Spatial clustering is an important component of spatial data mining. The requirement of detecting clusters of points arises in many applications. One of the challenges in spatial clustering is to find clusters on multi-density dataset. In this paper, a grid-based density-confidence-interval clustering algorithm for 2-dimensional multi-density dataset is proposed, called GDCIC. The proposed algorithm combines the density confidence interval with grid-based clustering, and produces accurate density estimation in local areas for local density thresholds. Local dense areas are distinguished from sparse areas or outliers according to these thresholds. Experiments based on both synthetic and real datasets verify that the algorithm is efficiently for multi-data sets and handle outliers effectively.
Keywords :
data mining; grid computing; pattern clustering; very large databases; 2D multidensity dataset; clustering algorithm; grid-based density-confidence-interval clustering algorithm; large database; local density thresholds; spatial clustering; spatial data mining; Application software; Clustering algorithms; Computer science; Data analysis; Data mining; Equations; Information science; Partitioning algorithms; Sampling methods; Spatial databases;
Conference_Titel :
Multimedia and Ubiquitous Engineering, 2007. MUE '07. International Conference on
Conference_Location :
Seoul
Print_ISBN :
0-7695-2777-9
DOI :
10.1109/MUE.2007.67