Title :
Cluster cores-based clustering for high dimensional data
Author :
Shen, Yi-Dong ; Shen, Zhi-Yong ; Zhang, Shi-Ming ; Yang, Qiang
Author_Institution :
Inst. of Software, Chinese Acad. of Sci., Beijing, China
Abstract :
We propose a new approach to clustering high dimensional data based on a novel notion of cluster cores, instead of on nearest neighbors. A cluster core is a fairly dense group with a maximal number of pairwise similar objects. It represents the core of a cluster, as all objects in a cluster are with a great degree attracted to it. As a result, building clusters from cluster cores achieves high accuracy. Other major characteristics of the approach include: (1) It uses a semantics-based similarity measure. (2) It does not incur the curse of dimensionality and is scalable linearly with the dimensionality of data. (3) It outperforms the well-known clustering algorithm, ROCK, with both lower time complexity and higher accuracy.
Keywords :
computational complexity; data mining; cluster cores-based clustering; clustering algorithm; curse of dimensionality; high dimensional data; semantics-based similarity measure; time complexity; Clustering algorithms; Clustering methods; Computer science; Data mining; Euclidean distance; Laboratories; Nearest neighbor searches; Robustness; Time measurement;
Conference_Titel :
Data Mining, 2004. ICDM '04. Fourth IEEE International Conference on
Print_ISBN :
0-7695-2142-8
DOI :
10.1109/ICDM.2004.10045