Title :
Distorted Metrics on Trees and Phylogenetic Forests
Author :
Mossel, Elchanan
Author_Institution :
Dept. of Stat., California Univ., Berkeley, CA
Abstract :
We study distorted metrics on binary trees in the context of phylogenetic reconstruction. Given a binary tree T on n leaves with a path metric d, consider the pairwise distances {d(u,v)} between leaves. It is well known that these determine the tree and the d length of all edges. Here, we consider distortions d of d such that, for all leaves u and v, it holds that |d(u,v)-dmacr(u,v)|<f/2 if either d(u,v)<M+f/2 or d(u,v)<M+f/2, where d satisfies flesd(e)lesg for all edges e. Given such distortions, we show how to reconstruct in polynomial time a forest T1.....T0 such that the true tree T may be obtained from that forest by adding alpha-1 edges and alpha-1les2-Omega(M/g)n. Our distorted metric result implies a reconstruction algorithm of phylogenetic forests with a small number of trees from sequences of length logarithmic in the number of species. The reconstruction algorithm is applicable for the general Markov model. Both the distorted metric result and its applications to phylogeny are almost tight
Keywords :
Markov processes; biology computing; evolution (biological); polynomials; trees (mathematics); distorted metrics; general Markov model; pairwise distances; phylogenetic forests; phylogenetic reconstruction; phylogeny; polynomial time; trees; Binary trees; Biological system modeling; Character generation; Computational complexity; Helium; Phylogeny; Polynomials; Reconstruction algorithms; Sequences; Upper bound; CFN; Jukes-Cantor; Phylogenetics; distortion.; forest; metric; tree; Algorithms; Animals; Computational Biology; Genome; Humans; Models, Genetic; Mutation; Phylogeny;
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
DOI :
10.1109/TCBB.2007.1010