DocumentCode :
2423471
Title :
A Fast Subspace Clustering Algorithm Based on Pattern Similarity
Author :
Gan, Yanglan ; Guan, Jihong ; Wang, Hao
Author_Institution :
Tongji Univ., Shanghai
Volume :
3
fYear :
2007
fDate :
24-27 Aug. 2007
Firstpage :
253
Lastpage :
257
Abstract :
Traditional clustering models define similarity by distance over dimensions. However, distance functions are not always adequate in capturing correlations among the objects. Pattern-based clustering can discover this kind of clusters. But state-of-the-art pattern-based clustering methods are inefficient and haven´t criteria to evaluate the quality of clusters. This paper presents a novel pattern similarity-based subspace clustering with the pattern tree (PPSC for short) that offers these capabilities. The method uses new evaluation criteria to discover best clusters, which enables user to find clusters according to different needs. Meanwhile, observing the analogy between mining frequent itemsets and discovering subspace clusters around random points, we apply the pattern-tree to determine subspace by scanning the database once, so it can perform efficiently in large datasets.
Keywords :
pattern clustering; set theory; pattern similarity; pattern-based clustering; random points; similarity-based subspace clustering; Clustering algorithms; Clustering methods; Computer science; Data mining; Databases; Fuzzy systems; Gallium nitride; Itemsets; Large-scale systems;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems and Knowledge Discovery, 2007. FSKD 2007. Fourth International Conference on
Conference_Location :
Haikou
Print_ISBN :
978-0-7695-2874-8
Type :
conf
DOI :
10.1109/FSKD.2007.24
Filename :
4406239
Link To Document :
بازگشت