DocumentCode
3321914
Title
Analyzing the Escherichia coli gene expression data by a multilayer adjusted tree organizing map
Author
Wei, Ning ; Gruenwald, Le ; Conway, Tyrrell
Author_Institution
Sch. of Comput. Sci., Oklahoma Univ., USA
fYear
2003
fDate
10-12 March 2003
Firstpage
289
Lastpage
296
Abstract
Using the DNA microarray technology, biologists have thousands of array data available. Discovering the function relations between genes and their involvements in biological processes depends on the ability to efficiently process and quantitatively analyze large amounts of array data. Clustering algorithms are among the popular tools that can be used to help biologists achieve their goals. Although some existing research projects employed clustering algorithms on biological data, none of them has examined the Escherichia coli (E. coli) gene expression data. This paper proposes a clustering algorithm called Multilayer Adjusted Tree Organizing Map (MA TOM) to analyze the E. coli gene expression data. In a semi-supervised manner, MATOM constructs a multilayer map, and at the same time, removes noise data in the previously trained maps in order to improve the training process. This paper then presents the clustering results produced by MATOM and other existing clustering algorithms using the E. coli gene expression data, and a new evaluation method to assess them. The results show that MATOM performs the best in terms of percentage of genes that are clustered correctly.
Keywords
DNA; arrays; biological techniques; biology computing; genetics; microorganisms; noise; trees (mathematics); Escherichia coli gene expression data analysis; MATOM; correctly clustered genes percentage; evaluation method; functional genomics; multilayer adjusted tree organizing map; noise data removal; semi-supervised manner; Algorithm design and analysis; Bioinformatics; Biological processes; Clustering algorithms; Computer science; DNA; Gene expression; Genomics; Nonhomogeneous media; Organizing;
fLanguage
English
Publisher
ieee
Conference_Titel
Bioinformatics and Bioengineering, 2003. Proceedings. Third IEEE Symposium on
Print_ISBN
0-7695-1907-5
Type
conf
DOI
10.1109/BIBE.2003.1188965
Filename
1188965
Link To Document