• DocumentCode
    3848552
  • Title

    Identifiability of Two-Tree Mixtures for Group-Based Models

  • Author

    Elizabeth S. Allman;Sonia Petrovic;John A. Rhodes;Seth Sullivant

  • Author_Institution
    University of Alaska, Fairbanks
  • Volume
    8
  • Issue
    3
  • fYear
    2011
  • Firstpage
    710
  • Lastpage
    722
  • Abstract
    Phylogenetic data arising on two possibly different tree topologies might be mixed through several biological mechanisms, including incomplete lineage sorting or horizontal gene transfer in the case of different topologies, or simply different substitution processes on characters in the case of the same topology. Recent work on a 2-state symmetric model of character change showed that for 4 taxa, such a mixture model has nonidentifiable parameters, and thus, it is theoretically impossible to determine the two tree topologies from any amount of data under such circumstances. Here, the question of identifiability is investigated for two-tree mixtures of the 4-state group-based models, which are more relevant to DNA sequence data. Using algebraic techniques, we show that the tree parameters are identifiable for the JC and K2P models. We also prove that generic substitution parameters for the JC mixture models are identifiable, and for the K2P and K3P models obtain generic identifiability results for mixtures on the same tree. This indicates that the full phylogenetic signal remains in such mixtures, and the 2-state symmetric result is thus a misleading guide to the behavior of other models.
  • Keywords
    "Computational modeling","Phylogeny","Biological system modeling","Markov processes","Polynomials","Data models","Analytical models"
  • Journal_Title
    IEEE/ACM Transactions on Computational Biology and Bioinformatics
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2010.79
  • Filename
    5557853