Title :
DICON: Interactive Visual Analysis of Multidimensional Clusters
Author :
Cao, Nan ; Gotz, David ; Sun, Jimeng ; Qu, Huamin
Author_Institution :
Dept. of Comput. Sci. & Eng., Hong Kong Univ. of Sci. & Technol., Hong Kong, China
Abstract :
Clustering as a fundamental data analysis technique has been widely used in many analytic applications. However, it is often difficult for users to understand and evaluate multidimensional clustering results, especially the quality of clusters and their semantics. For large and complex data, high-level statistical information about the clusters is often needed for users to evaluate cluster quality while a detailed display of multidimensional attributes of the data is necessary to understand the meaning of clusters. In this paper, we introduce DICON, an icon-based cluster visualization that embeds statistical information into a multi-attribute display to facilitate cluster interpretation, evaluation, and comparison. We design a treemap-like icon to represent a multidimensional cluster, and the quality of the cluster can be conveniently evaluated with the embedded statistical information. We further develop a novel layout algorithm which can generate similar icons for similar clusters, making comparisons of clusters easier. User interaction and clutter reduction are integrated into the system to help users more effectively analyze and refine clustering results for large datasets. We demonstrate the power of DICON through a user study and a case study in the healthcare domain. Our evaluation shows the benefits of the technique, especially in support of complex multidimensional cluster analysis.
Keywords :
data analysis; data structures; data visualisation; embedded systems; interactive systems; pattern clustering; statistical distributions; DICON; cluster quality; clutter reduction; complex data; fundamental data analysis technique; high-level statistical information; icon-based cluster visualization; interactive visual analysis; layout algorithm; multidimensional attribute display; multidimensional cluster; statistical information; user interaction; Algorithm design and analysis; Clustering algorithms; Encoding; Image color analysis; Information analysis; Visualization; Clustering; Information Visualization.; Visual Analysis; Algorithms; Cluster Analysis; Computer Graphics; Data Interpretation, Statistical; Databases, Factual; Humans; User-Computer Interface;
Journal_Title :
Visualization and Computer Graphics, IEEE Transactions on
DOI :
10.1109/TVCG.2011.188