Title :
Exact Computation of Coalescent Likelihood for Panmictic and Subdivided Populations under the Infinite Sites Model
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. of Connecticut, Storrs, CT, USA
Abstract :
Coalescent likelihood is the probability of observing the given population sequences under the coalescent model. Computation of coalescent likelihood under the infinite sites model is a classic problem in coalescent theory. Existing methods are based on either importance sampling or Markov chain Monte Carlo and are inexact. In this paper, we develop a simple method that can compute the exact coalescent likelihood for many data sets of moderate size, including real biological data whose likelihood was previously thought to be difficult to compute exactly. Our method works for both panmictic and subdivided populations. Simulations demonstrate that the practical range of exact coalescent likelihood computation for panmictic populations is significantly larger than what was previously believed. We investigate the application of our method in estimating mutation rates by maximum likelihood. A main application of the exact method is comparing the accuracy of approximate methods. To demonstrate the usefulness of the exact method, we evaluate the accuracy of program Genetree in computing the likelihood for subdivided populations.
Keywords :
bioinformatics; biology computing; genetics; molecular biophysics; coalescent likelihood exact computation; exact coalescent likelihood computation; infinite sites model; mutation rates; panmictic population; program Genetree; subdivided population; Biological system modeling; Biology computing; Computational modeling; Computer science; Data analysis; Genetic mutations; Maximum likelihood estimation; Monte Carlo methods; Stochastic processes; Throughput; Population genetics; algorithms; coalescent theory; subdivided population.; Computer Simulation; Genetics, Population; Haplotypes; Likelihood Functions; Markov Chains; Monte Carlo Method; Mutation; Phylogeny; Probability;
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
DOI :
10.1109/TCBB.2010.2