DocumentCode :
2121289
Title :
Mining Hierarchies of Correlation Clusters
Author :
Achtert, Elke ; Böhm, Christian ; Kröger, Peer ; Zimek, Arthur
Author_Institution :
Inst. for Comput. Sci., Univ. of Munich
fYear :
0
fDate :
0-0 0
Firstpage :
119
Lastpage :
128
Abstract :
The detection of correlations between different features in high dimensional data sets is a very important data mining task. These correlations can be arbitrarily complex: one or more features might be correlated with several other features, and both noise features as well as the actual dependencies may be different for different clusters. Therefore, each cluster contains points that are located on a common hyperplane of arbitrary dimensionality in the data space and thus generates a separate, arbitrarily oriented subspace of the original data space. The few recently proposed algorithms designed to uncover these correlation clusters have several disadvantages. In particular, these methods cannot detect correlation clusters of different dimensionality which are nested into each other. The complete hierarchical structure of correlation clusters of varying dimensionality can only be detected by a hierarchical clustering approach. Therefore, we propose the algorithm HiCO (hierarchical correlation ordering), the first hierarchical approach to correlation clustering. The algorithm determines the cluster hierarchy, and visualizes it using correlation diagrams. Several comparative experiments using synthetic and real data sets show the performance and the effectivity of HiCO
Keywords :
data mining; data structures; pattern clustering; principal component analysis; very large databases; correlation cluster; correlation diagram; data mining; hierarchical correlation ordering; high dimensional data set; Algorithm design and analysis; Clustering algorithms; Computer science; Data mining; Data visualization; Databases; Gene expression; Linear approximation; Principal component analysis; Systems biology;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Scientific and Statistical Database Management, 2006. 18th International Conference on
Conference_Location :
Vienna
ISSN :
1551-6393
Print_ISBN :
0-7695-2590-3
Type :
conf
DOI :
10.1109/SSDBM.2006.35
Filename :
1644305
Link To Document :
بازگشت