• DocumentCode
    1050414
  • Title

    A Unified Approach for Reconstructing Ancient Gene Clusters

  • Author

    Stoye, Jens ; Wittler, Roland

  • Author_Institution
    Genome Inf. Group, Bielefeld Univ., Bielefeld, Germany
  • Volume
    6
  • Issue
    3
  • fYear
    2009
  • Firstpage
    387
  • Lastpage
    400
  • Abstract
    The order of genes in genomes provides extensive information. In comparative genomics, differences or similarities of gene orders are determined to predict functional relations of genes or phylogenetic relations of genomes. For this purpose, various combinatorial models can be used to identify gene clusters-groups of genes that are colocated in a set of genomes. We introduce a unified approach to model gene clusters and define the problem of labeling the inner nodes of a given phylogenetic tree with sets of gene clusters. Our optimization criterion in this context combines two properties: parsimony, i.e., the number of gains and losses of gene clusters has to be minimal, and consistency, i.e., for each ancestral node, there must exist at least one potential gene order that contains all the reconstructed clusters. We present and evaluate an exact algorithm to solve this problem. Despite its exponential worst-case time complexity, our method is suitable even for large-scale data. We show the effectiveness and efficiency on both simulated and real data.
  • Keywords
    genetics; genomics; pattern clustering; statistical analysis; ancestral node; comparative genomics; exponential worst-case time complexity; gene cluster reconstruction; optimization criterion; phylogenetic tree; Comparative genomics; consistency.; gene cluster; gene cluster reconstruction; gene order; parsimony; phylogeny; Algorithms; Bacteria; Computer Simulation; Gene Order; Genome, Bacterial; Genomics; Models, Genetic; Multigene Family; Phylogeny;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2008.135
  • Filename
    4731235