DocumentCode
1938411
Title
A Multi-Relational Hierarchical Clustering Algorithm Based on Shared Nearest Neighbor Similarity
Author
Guo, Jing-Feng ; Zhao, Yu-Yan ; Li, Jing
Volume
7
fYear
2007
fDate
19-22 Aug. 2007
Firstpage
3951
Lastpage
3955
Abstract
The clustering about relational databases is an active study subject in data mining. In this paper, we introduce a multi-relational hierarchical clustering algorithm based on shared nearest neighbor similarity (MHSNNS). First, this algorithm joins every table through the tuple 1D propagation. Then, groups objects into a large number of relatively small sub-clusters using the shared nearest neighbor algorithm and the cluster cohesion. Last, find the genuine clusters by repeatedly combining these sub-clusters using the cluster separation. The experiment shows the efficiency and scalability of this approach.
Keywords
data mining; hierarchical systems; pattern clustering; relational databases; cluster separation; data mining; multirelational hierarchical clustering; relational databases; shared nearest neighbor similarity; Clustering algorithms; Cybernetics; Data mining; Educational institutions; Machine learning; Machine learning algorithms; Nearest neighbor searches; Partitioning algorithms; Relational databases; Scalability; Data mining; Hierarchical clustering; Multi-relational clustering; Relational databases; Shared nearest neighbor;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics, 2007 International Conference on
Conference_Location
Hong Kong
Print_ISBN
978-1-4244-0973-0
Electronic_ISBN
978-1-4244-0973-0
Type
conf
DOI
10.1109/ICMLC.2007.4370836
Filename
4370836
Link To Document