Title :
On Exploring Complex Relationships of Correlation Clusters
Author :
Achtert, Elke ; Böhm, Christian ; Kriegel, Hans-Peter ; Kröger, Peer ; Zimek, Arthur
Author_Institution :
Ludwig-Maximilians-Univ., Munich
Abstract :
In high dimensional data, clusters often only exist in arbitrarily oriented subspaces of the feature space. In addition, these so-called correlation clusters may have complex relationships between each other. For example, a correlation cluster in a 1-D subspace (forming a line) may be enclosed within one or even several correlation clusters in 2-D superspaces (forming planes). In general, such relationships can be seen as a complex hierarchy that allows multiple inclusions, i.e. clusters may be embedded in several super-clusters rather than only in one. Obviously, uncovering the hierarchical relationships between the detected correlation clusters is an important information gain. Since existing approaches cannot detect such complex hierarchical relationships among correlation clusters, we propose the algorithm ERiC to tackle this problem and to visualize the result by means of a graph-based representation. In our experimental evaluation, we show that ERiC finds more information than state-of-the-art correlation clustering methods and outperforms existing competitors in terms of efficiency.
Keywords :
data mining; data visualisation; graph theory; pattern clustering; 2D superspaces; ERiC algorithm; cluster hierarchy visualization; correlation cluster complex relationships; data mining task; graph-based representation; high dimensional data; Blood; Clustering algorithms; Clustering methods; Data visualization; Databases; Diseases; Gene expression; Genetics; Informatics; Principal component analysis;
Conference_Titel :
Scientific and Statistical Database Management, 2007. SSBDM '07. 19th International Conference on
Conference_Location :
Banff, Alta.
Print_ISBN :
0-7695-2868-6
Electronic_ISBN :
1551-6393
DOI :
10.1109/SSDBM.2007.21