• DocumentCode
    1990795
  • Title

    A Heuristic for Phylogenetic Reconstruction Using Transposition

  • Author

    Yue, Feng ; Zhang, Meng ; Tang, Jijun

  • Author_Institution
    Univ. of South, Columbia
  • fYear
    2007
  • fDate
    14-17 Oct. 2007
  • Firstpage
    802
  • Lastpage
    808
  • Abstract
    Because of the advent of high-throughput sequencing and the consequent reduction in cost of sequencing, many organisms have been completely sequenced and most of their genes identified; homologies among these genes are also getting established. It thus has become possible to represent whole genomes as ordered lists of gene identifiers and to study the evolution of these entities through computational means, in systematics as well as in comparative genomics. As a result, gene order data (also known as genome rearrangement data) has attracted increasing attention from both biologists and computer scientists as a new type of data for phylogenetic analysis. Methods for reconstructing phylogeny from genome rearrangements include distance-based methods, MCMC methods and direct optimization methods. The latter, pioneered by Sankoff and extended in the software packages of grappa and MGR, is the most accurate approach for inversion phylogeny. However, due to the difficulty of computing the transposition distance, this type of methods has not been applied to datasets where transposition is the only or dominant event. In this paper, we present a heuristic transposition median solver and extend grappa to handle transpositions. Our extensive testing using simulated datasets shows that this method (GRAPPA-TP) is very accurate in terms of ancestor genome inference and phylogenetic reconstruction. It also suggests that model match is critical in phylogenetic analysis, and a fast and accurate method for transposition distance computation is still very important. The new GRAPPA-TP is available from phylo.cse.sc.edu.
  • Keywords
    biology computing; genetics; heuristic programming; ancestor genome inference; extend GRAPPA; heuristic transposition median solver; high-throughput sequencing; phylogenetic reconstruction; Bioinformatics; Biology computing; Costs; Data analysis; Evolution (biology); Genomics; Optimization methods; Organisms; Phylogeny; Systematics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Bioengineering, 2007. BIBE 2007. Proceedings of the 7th IEEE International Conference on
  • Conference_Location
    Boston, MA
  • Print_ISBN
    978-1-4244-1509-0
  • Type

    conf

  • DOI
    10.1109/BIBE.2007.4375652
  • Filename
    4375652