DocumentCode
2554697
Title
An Efficient Clustering Algorithm for 2D Multi-density Dataset in Large Database
Author
Xia, Ying ; Wang, Guoyin ; Gao, Song
Author_Institution
Southwest Jiaotong Univ., Chengdu
fYear
2007
fDate
26-28 April 2007
Firstpage
78
Lastpage
82
Abstract
Spatial clustering is an important component of spatial data mining. The requirement of detecting clusters of points arises in many applications. One of the challenges in spatial clustering is to find clusters on multi-density dataset. In this paper, a grid-based density-confidence-interval clustering algorithm for 2-dimensional multi-density dataset is proposed, called GDCIC. The proposed algorithm combines the density confidence interval with grid-based clustering, and produces accurate density estimation in local areas for local density thresholds. Local dense areas are distinguished from sparse areas or outliers according to these thresholds. Experiments based on both synthetic and real datasets verify that the algorithm is efficiently for multi-data sets and handle outliers effectively.
Keywords
data mining; grid computing; pattern clustering; very large databases; 2D multidensity dataset; clustering algorithm; grid-based density-confidence-interval clustering algorithm; large database; local density thresholds; spatial clustering; spatial data mining; Application software; Clustering algorithms; Computer science; Data analysis; Data mining; Equations; Information science; Partitioning algorithms; Sampling methods; Spatial databases;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia and Ubiquitous Engineering, 2007. MUE '07. International Conference on
Conference_Location
Seoul
Print_ISBN
0-7695-2777-9
Type
conf
DOI
10.1109/MUE.2007.67
Filename
4197253
Link To Document