Title :
CUBN: A clustering algorithm based on density and distance
Author :
Wang, Li ; Wang, Zheng-Ou
Author_Institution :
Inst. of Syst. Eng., Tianjin Univ., China
Abstract :
In data mining, clustering is used to discover groups and identify interesting distribution on the underlying data. Traditional clustering algorithm favors clusters with spherical shapes and similar sizes. We propose a new clustering algorithm called CUBN that integrates density-based and distance-based clustering. Firstly CUBN finds border points by using erosion operation that is one of the basic operations in mathematical morphology, then, it clusters the border points and inner points according to the nearest distance. Our experimental results show that CUBN can identify clusters having non-spherical shapes and wide variances in size, and its computational complexity is O(n). Therefore, this algorithm facilitates the clustering of a very large data set.
Keywords :
computational complexity; data mining; mathematical morphology; pattern clustering; CUBN; border points; clustering using border and nearest; computational complexity; data mining; density-based clustering; distance-based clustering; erosion operation; inner points; mathematical morphology; spherical shapes; Clustering algorithms; Clustering methods; Computational complexity; Data engineering; Data mining; Machine learning; Mathematics; Partitioning algorithms; Shape; Systems engineering and theory;
Conference_Titel :
Machine Learning and Cybernetics, 2003 International Conference on
Print_ISBN :
0-7803-8131-9
DOI :
10.1109/ICMLC.2003.1264452