• DocumentCode
    1151690
  • Title

    Distorted Metrics on Trees and Phylogenetic Forests

  • Author

    Mossel, Elchanan

  • Author_Institution
    Dept. of Stat., California Univ., Berkeley, CA
  • Volume
    4
  • Issue
    1
  • fYear
    2007
  • Firstpage
    108
  • Lastpage
    116
  • Abstract
    We study distorted metrics on binary trees in the context of phylogenetic reconstruction. Given a binary tree T on n leaves with a path metric d, consider the pairwise distances {d(u,v)} between leaves. It is well known that these determine the tree and the d length of all edges. Here, we consider distortions d of d such that, for all leaves u and v, it holds that |d(u,v)-dmacr(u,v)|<f/2 if either d(u,v)<M+f/2 or d(u,v)<M+f/2, where d satisfies flesd(e)lesg for all edges e. Given such distortions, we show how to reconstruct in polynomial time a forest T1.....T0 such that the true tree T may be obtained from that forest by adding alpha-1 edges and alpha-1les2-Omega(M/g)n. Our distorted metric result implies a reconstruction algorithm of phylogenetic forests with a small number of trees from sequences of length logarithmic in the number of species. The reconstruction algorithm is applicable for the general Markov model. Both the distorted metric result and its applications to phylogeny are almost tight
  • Keywords
    Markov processes; biology computing; evolution (biological); polynomials; trees (mathematics); distorted metrics; general Markov model; pairwise distances; phylogenetic forests; phylogenetic reconstruction; phylogeny; polynomial time; trees; Binary trees; Biological system modeling; Character generation; Computational complexity; Helium; Phylogeny; Polynomials; Reconstruction algorithms; Sequences; Upper bound; CFN; Jukes-Cantor; Phylogenetics; distortion.; forest; metric; tree; Algorithms; Animals; Computational Biology; Genome; Humans; Models, Genetic; Mutation; Phylogeny;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2007.1010
  • Filename
    4104464