• DocumentCode
    3321914
  • Title

    Analyzing the Escherichia coli gene expression data by a multilayer adjusted tree organizing map

  • Author

    Wei, Ning ; Gruenwald, Le ; Conway, Tyrrell

  • Author_Institution
    Sch. of Comput. Sci., Oklahoma Univ., USA
  • fYear
    2003
  • fDate
    10-12 March 2003
  • Firstpage
    289
  • Lastpage
    296
  • Abstract
    Using the DNA microarray technology, biologists have thousands of array data available. Discovering the function relations between genes and their involvements in biological processes depends on the ability to efficiently process and quantitatively analyze large amounts of array data. Clustering algorithms are among the popular tools that can be used to help biologists achieve their goals. Although some existing research projects employed clustering algorithms on biological data, none of them has examined the Escherichia coli (E. coli) gene expression data. This paper proposes a clustering algorithm called Multilayer Adjusted Tree Organizing Map (MA TOM) to analyze the E. coli gene expression data. In a semi-supervised manner, MATOM constructs a multilayer map, and at the same time, removes noise data in the previously trained maps in order to improve the training process. This paper then presents the clustering results produced by MATOM and other existing clustering algorithms using the E. coli gene expression data, and a new evaluation method to assess them. The results show that MATOM performs the best in terms of percentage of genes that are clustered correctly.
  • Keywords
    DNA; arrays; biological techniques; biology computing; genetics; microorganisms; noise; trees (mathematics); Escherichia coli gene expression data analysis; MATOM; correctly clustered genes percentage; evaluation method; functional genomics; multilayer adjusted tree organizing map; noise data removal; semi-supervised manner; Algorithm design and analysis; Bioinformatics; Biological processes; Clustering algorithms; Computer science; DNA; Gene expression; Genomics; Nonhomogeneous media; Organizing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Bioengineering, 2003. Proceedings. Third IEEE Symposium on
  • Print_ISBN
    0-7695-1907-5
  • Type

    conf

  • DOI
    10.1109/BIBE.2003.1188965
  • Filename
    1188965