Title :
EGGS: Extraction of Gene Clusters Using Genome Context Based Sequence Matching Techniques
Author :
Kim, Sun ; Bhan, Ankita ; Maryada, Bharath K. ; Choi, Kwangmin ; Brun, Yves V.
Author_Institution :
Indiana Univ., Fort Wayne
Abstract :
Functionally related genes co-evolve, probably due to selection pressures during evolution, This phenomenon leads to conservation of gene clusters across genomes, especially in microbial genomes. In this paper, we propose novel iterative constraint relaxation algorithms which make use of genome contexts to effectively remove noise and extract gene clusters: PairEGGS that generates gene clusters in a pair of genomes and MultiEGGS that combines gene clusters from genome pairs. Experiments showed that PairEGGS produced significantly larger gene clusters than existing algorithms, say FISH, and MultiEGGS was able to find gene clusters as large as of 118 genes that are common to three genomes. Both PairEGGS and MultiEGGS run fast enough to provide service on the web.
Keywords :
biology computing; genetics; pattern clustering; FISH; MultiEGGS; PairEGGS; Web service; gene cluster extraction; genome context based sequence matching; iterative constraint relaxation algorithms; microbial genomes; related genes; Bioinformatics; Clustering algorithms; Data mining; Evolution (biology); Genomics; Iterative algorithms; Marine animals; Sequences; Sun; USA Councils;
Conference_Titel :
Bioinformatics and Biomedicine, 2007. BIBM 2007. IEEE International Conference on
Conference_Location :
Fremont, CA
Print_ISBN :
978-0-7695-3031-4
DOI :
10.1109/BIBM.2007.61