• DocumentCode
    2048314
  • Title

    Phylogenetic models of rate heterogeneity: a high performance computing perspective

  • Author

    Stamatakis, Alexandros

  • Author_Institution
    Inst. of Comput. Sci., Found. for Res. & Technol.-Hellas, Heraklion
  • fYear
    2006
  • fDate
    25-29 April 2006
  • Abstract
    Inference of phylogenetic trees using the maximum likelihood (ML) method is NP-hard. Furthermore, the computation of the likelihood function for huge trees of more than 1,000 organisms is computationally intensive due to a large amount of floating point operations and high memory consumption. Within this context, the present paper compares two competing mathematical models that account for evolutionary rate heterogeneity: the Gamma and CAT models. The intention of this paper is to show that - from a purely empirical point of view - CAT can be used instead of Gamma. The main advantage of CAT over Gamma consists in significantly lower memory consumption and faster inference times. An experimental study using RAxML has been performed on 19 real-world datasets comprising 73 up to 1,663 DNA sequences. Results show that CAT is on average 5.5 times faster than Gamma and - surprisingly enough - also yields trees with slightly superior Gamma likelihood values. The usage of the CAT model decreases the amount of average L2 and L3 cache misses by factor 8.55
  • Keywords
    biocomputing; biology computing; computational complexity; genetics; maximum likelihood estimation; trees (mathematics); CAT models; DNA sequences; NP-hard problem; RAxML; floating point operations; high memory consumption; high performance computing perspective; maximum likelihood; phylogenetic trees; rate heterogeneity; Biology computing; Context modeling; DNA; High performance computing; Inference algorithms; Mathematical model; Organisms; Phylogeny; Sequences; Topology;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Symposium, 2006. IPDPS 2006. 20th International
  • Conference_Location
    Rhodes Island
  • Print_ISBN
    1-4244-0054-6
  • Type

    conf

  • DOI
    10.1109/IPDPS.2006.1639535
  • Filename
    1639535