DocumentCode
3322003
Title
Hierarchical clustering of gene expression data
Author
Luo, Feng ; Tang, Kun ; Khan, Latifur
Author_Institution
Dept. of Comput. Sci., Texas Univ., Richardson, TX, USA
fYear
2003
fDate
10-12 March 2003
Firstpage
328
Lastpage
335
Abstract
Rapid development of biological technologies generates a huge amount of data, which provides a processing and global view of the gene expression levels across different conditions and over multiple stages. Analyzation and interpretation of these massive data is a challenging task. One of the most important steps is to extract useful and rational fundamental patterns of gene expression inherent in these huge data. Clustering technology is one of the useful and popular methods to obtain these patterns. In this paper we propose a new hierarchical clustering algorithm to obtain gene expression patterns. This algorithm constructs a hierarchy from top to bottom based on a self-organizing tree. It dynamically finds the number of clusters at each level. We compare our algorithm with the traditional hierarchical agglomerative clustering (HAC) algorithm. We apply our algorithm to an existing 112 rat central nervous system gene expression data. We observe that our algorithm extracts patterns with different levels of abstraction. Furthermore, our approach is useful on recognizing features in complex gene expression data.
Keywords
biology computing; genetics; neurophysiology; trees (mathematics); biological technologies development; complex gene expression data; features recognition; gene expression data; gene expression levels; hierarchical clustering; hierarchy construction; patterns extraction; rat central nervous system; self-organizing tree; traditional hierarchical agglomerative clustering algorithm; useful rational fundamental patterns; Algorithm design and analysis; Biology; Biomedical measurements; Central nervous system; Clustering algorithms; Computer science; Data analysis; Data mining; Gene expression; Neural networks;
fLanguage
English
Publisher
ieee
Conference_Titel
Bioinformatics and Bioengineering, 2003. Proceedings. Third IEEE Symposium on
Print_ISBN
0-7695-1907-5
Type
conf
DOI
10.1109/BIBE.2003.1188970
Filename
1188970
Link To Document