• DocumentCode
    2889879
  • Title

    A Simple and Accurate Method for Rogue Taxon Identification

  • Author

    Aberer, Andre J. ; Stamatakis, Alexandros

  • Author_Institution
    Exelixis Lab., Heidelberg Inst. for Theor. Studies, Heidelberg, Germany
  • fYear
    2011
  • fDate
    12-15 Nov. 2011
  • Firstpage
    118
  • Lastpage
    122
  • Abstract
    The summary of a phylogenetic analysis (typically a consensus tree) can be substantially biased by so-called rogue taxa (or briefly: rogues). Rogues assume varying phylogenetic positions in the tree collection that is used to build the consensus tree and thereby decrease the resolution of the consensus. We present an accurate and straight-forward algorithm for identifying rogues that assesses the effect on the consensus tree support values by removing one taxon at a time. Our approach improves the resolution of the consensus tree and, at the same time, increases the support values of existing relationships. We compare our algorithm to three competing methods (leaf stability index, taxonomic instability index, and Pattengale´s algorithm) on a large number of real biological data sets. We show that it outperforms stability-based methods since rogue taxa are not necessarily the most unstable taxa with respect to stability measures. Our algorithm is more memory-efficient than Pattengale´s approach while instances, where Pattengale´s algorithm outperforms our approach, appear to be rare on real data. Finally, we find that, it is advisable to conduct a de novo bootstrap analysis after rogues have been removed from the sequence alignment.
  • Keywords
    biological techniques; biology computing; Pattengale algorithm; consensus tree; de novo bootstrap analysis; leaf stability index; phylogenetic analysis; real biological data set; rogue taxa; rogue taxon identification; taxonomic instability index; tree support value; Algorithm design and analysis; Indexes; Large scale integration; Phylogeny; Stability criteria; Vegetation; consensus tree; leaf stability index; phylogenetic post-analysis; rogue taxa; taxonomic instability index;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedicine (BIBM), 2011 IEEE International Conference on
  • Conference_Location
    Atlanta, GA
  • Print_ISBN
    978-1-4577-1799-4
  • Type

    conf

  • DOI
    10.1109/BIBM.2011.70
  • Filename
    6120419