DocumentCode
478742
Title
Clustering of Gene Expression Data: Performance and Similarity Analysis
Author
Yin, Longde ; Huang, Chun-Hsi
Author_Institution
Dept. of Comput. Sci. & Eng., Connecticut Univ., Storrs, CT
Volume
1
fYear
2006
fDate
20-24 June 2006
Firstpage
142
Lastpage
149
Abstract
Recent advances of the DNA microarray technology allow monitoring gene expression profiles of thousands of genes simultaneously. However, the analysis and handling of such fast growing data is becoming the major bottleneck in the utilization of the technology. Clustering analysis is one of the most effective methods for analyzing such gene expression data. In this paper we first experimentally study three major clustering algorithms: hierarchical clustering, self-organizing map (SOM), and self organizing tree algorithm (SOTA), using yeast saccharomyces cerevisiae gene expression data, and compare their performance. Then, we present a data mining tool, cluster diff, which allows the similarity analysis of clusters generated by different algorithms. A case study is conducted based on clusters generated by SOTA and SOM
Keywords
DNA; biology computing; data mining; genetics; molecular biophysics; pattern clustering; self-organising feature maps; DNA microarray technology; SOM; SOTA; clustering analysis; data mining tool; hierarchical clustering; self organizing tree algorithm; self-organizing map; yeast saccharomyces cerevisiae gene expression data; Algorithm design and analysis; Chemical technology; Clustering algorithms; Clustering methods; DNA; Data mining; Fungi; Gene expression; Organizing; Performance analysis; Cluster Similarity Analysis; Clustering algorithms; Gene expression; Microarray; Performance study;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer and Computational Sciences, 2006. IMSCCS '06. First International Multi-Symposiums on
Conference_Location
Hanzhou, Zhejiang
Print_ISBN
0-7695-2581-4
Type
conf
DOI
10.1109/IMSCCS.2006.43
Filename
4673538
Link To Document