Title :
Heat Map Visualizations Allow Comparison of Multiple Clustering Results and Evaluation of Dataset Quality: Application to Microarray Data
Author :
Sharko, John ; Grinstein, Georges G. ; Marx, Kenneth A. ; Zhou, Jianping ; Cheng, Chia-Ho ; Odelberg, Shannon ; Simon, Hans-Georg
Author_Institution :
Univ. of Massachusetts, Lowell
Abstract :
Since clustering algorithms are heuristic, multiple clustering algorithms applied to the same dataset will typically not generate the same sets of clusters. This is especially true for complex datasets such as those from microarray time series experiments. Two such microarray datasets describing gene expression activities from regenerating newt forelimbs at various times following limb amputation were used in this study. A cluster stability matrix, which shows the number of times two genes appear in the same cluster, was generated as a heat map. This was used to evaluate the overall variation among the clustering algorithms and to identify similar clusters. A comparison of the cluster stability matrices for two related microarray experiments with different levels of precision was shown to be an effective basis for comparing the quality of the two sets of experiments. A pairwise heat map was generated to show which pairs of clustering algorithms grouped the data into similar clusters.
Keywords :
biology computing; data analysis; data visualisation; genetics; pattern clustering; stability; statistical databases; cluster stability matrix; clustering algorithms; dataset quality; gene expression activities; heat map visualizations; microarray data; microarray time series experiments; Algorithm design and analysis; Application software; Cities and towns; Clustering algorithms; Data visualization; Gene expression; Heuristic algorithms; Pediatrics; RNA; Stability;
Conference_Titel :
Information Visualization, 2007. IV '07. 11th International Conference
Conference_Location :
Zurich
Print_ISBN :
0-7695-2900-3