• DocumentCode
    3322003
  • Title

    Hierarchical clustering of gene expression data

  • Author

    Luo, Feng ; Tang, Kun ; Khan, Latifur

  • Author_Institution
    Dept. of Comput. Sci., Texas Univ., Richardson, TX, USA
  • fYear
    2003
  • fDate
    10-12 March 2003
  • Firstpage
    328
  • Lastpage
    335
  • Abstract
    Rapid development of biological technologies generates a huge amount of data, which provides a processing and global view of the gene expression levels across different conditions and over multiple stages. Analyzation and interpretation of these massive data is a challenging task. One of the most important steps is to extract useful and rational fundamental patterns of gene expression inherent in these huge data. Clustering technology is one of the useful and popular methods to obtain these patterns. In this paper we propose a new hierarchical clustering algorithm to obtain gene expression patterns. This algorithm constructs a hierarchy from top to bottom based on a self-organizing tree. It dynamically finds the number of clusters at each level. We compare our algorithm with the traditional hierarchical agglomerative clustering (HAC) algorithm. We apply our algorithm to an existing 112 rat central nervous system gene expression data. We observe that our algorithm extracts patterns with different levels of abstraction. Furthermore, our approach is useful on recognizing features in complex gene expression data.
  • Keywords
    biology computing; genetics; neurophysiology; trees (mathematics); biological technologies development; complex gene expression data; features recognition; gene expression data; gene expression levels; hierarchical clustering; hierarchy construction; patterns extraction; rat central nervous system; self-organizing tree; traditional hierarchical agglomerative clustering algorithm; useful rational fundamental patterns; Algorithm design and analysis; Biology; Biomedical measurements; Central nervous system; Clustering algorithms; Computer science; Data analysis; Data mining; Gene expression; Neural networks;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Bioengineering, 2003. Proceedings. Third IEEE Symposium on
  • Print_ISBN
    0-7695-1907-5
  • Type

    conf

  • DOI
    10.1109/BIBE.2003.1188970
  • Filename
    1188970