DocumentCode :
3107484
Title :
Speedup Clustering with Hierarchical Ranking
Author :
Zhou, Jianjun ; Sander, Joerg
Author_Institution :
Dept. of Comput. Sci., Univ. of Alberta, Edmonton, AB
fYear :
2006
fDate :
18-22 Dec. 2006
Firstpage :
1205
Lastpage :
1210
Abstract :
Many clustering algorithms in particular hierarchical clustering algorithms do not scale-up well for large data-sets especially when using an expensive distance function. In this paper, we propose a novel approach to perform approximate clustering with high accuracy. We introduce the concept of a pairwise hierarchical ranking to efficiently determine close neighbors for every data object. Empirical results on synthetic and real-life data show a speedup of up to two orders of magnitude over OPTICS while maintaining a high accuracy and up to one order of magnitude over the previously proposed DATA BUBBLES method, which also tries to speedup OPTICS by trading accuracy for speed.
Keywords :
data mining; pattern clustering; OPTICS; approximate clustering; data bubbles method; data mining; expensive distance function; hierarchical clustering algorithm; pairwise hierarchical ranking; speedup clustering; Clustering algorithms; Computer applications; Extraterrestrial measurements; Indexing; Optical computing; Optical distortion; Proteins; Runtime; Sampling methods; Statistics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2006. ICDM '06. Sixth International Conference on
Conference_Location :
Hong Kong
ISSN :
1550-4786
Print_ISBN :
0-7695-2701-7
Type :
conf
DOI :
10.1109/ICDM.2006.151
Filename :
4053180
Link To Document :
بازگشت