DocumentCode
1050414
Title
A Unified Approach for Reconstructing Ancient Gene Clusters
Author
Stoye, Jens ; Wittler, Roland
Author_Institution
Genome Inf. Group, Bielefeld Univ., Bielefeld, Germany
Volume
6
Issue
3
fYear
2009
Firstpage
387
Lastpage
400
Abstract
The order of genes in genomes provides extensive information. In comparative genomics, differences or similarities of gene orders are determined to predict functional relations of genes or phylogenetic relations of genomes. For this purpose, various combinatorial models can be used to identify gene clusters-groups of genes that are colocated in a set of genomes. We introduce a unified approach to model gene clusters and define the problem of labeling the inner nodes of a given phylogenetic tree with sets of gene clusters. Our optimization criterion in this context combines two properties: parsimony, i.e., the number of gains and losses of gene clusters has to be minimal, and consistency, i.e., for each ancestral node, there must exist at least one potential gene order that contains all the reconstructed clusters. We present and evaluate an exact algorithm to solve this problem. Despite its exponential worst-case time complexity, our method is suitable even for large-scale data. We show the effectiveness and efficiency on both simulated and real data.
Keywords
genetics; genomics; pattern clustering; statistical analysis; ancestral node; comparative genomics; exponential worst-case time complexity; gene cluster reconstruction; optimization criterion; phylogenetic tree; Comparative genomics; consistency.; gene cluster; gene cluster reconstruction; gene order; parsimony; phylogeny; Algorithms; Bacteria; Computer Simulation; Gene Order; Genome, Bacterial; Genomics; Models, Genetic; Multigene Family; Phylogeny;
fLanguage
English
Journal_Title
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher
ieee
ISSN
1545-5963
Type
jour
DOI
10.1109/TCBB.2008.135
Filename
4731235
Link To Document