DocumentCode
2889879
Title
A Simple and Accurate Method for Rogue Taxon Identification
Author
Aberer, Andre J. ; Stamatakis, Alexandros
Author_Institution
Exelixis Lab., Heidelberg Inst. for Theor. Studies, Heidelberg, Germany
fYear
2011
fDate
12-15 Nov. 2011
Firstpage
118
Lastpage
122
Abstract
The summary of a phylogenetic analysis (typically a consensus tree) can be substantially biased by so-called rogue taxa (or briefly: rogues). Rogues assume varying phylogenetic positions in the tree collection that is used to build the consensus tree and thereby decrease the resolution of the consensus. We present an accurate and straight-forward algorithm for identifying rogues that assesses the effect on the consensus tree support values by removing one taxon at a time. Our approach improves the resolution of the consensus tree and, at the same time, increases the support values of existing relationships. We compare our algorithm to three competing methods (leaf stability index, taxonomic instability index, and Pattengale´s algorithm) on a large number of real biological data sets. We show that it outperforms stability-based methods since rogue taxa are not necessarily the most unstable taxa with respect to stability measures. Our algorithm is more memory-efficient than Pattengale´s approach while instances, where Pattengale´s algorithm outperforms our approach, appear to be rare on real data. Finally, we find that, it is advisable to conduct a de novo bootstrap analysis after rogues have been removed from the sequence alignment.
Keywords
biological techniques; biology computing; Pattengale algorithm; consensus tree; de novo bootstrap analysis; leaf stability index; phylogenetic analysis; real biological data set; rogue taxa; rogue taxon identification; taxonomic instability index; tree support value; Algorithm design and analysis; Indexes; Large scale integration; Phylogeny; Stability criteria; Vegetation; consensus tree; leaf stability index; phylogenetic post-analysis; rogue taxa; taxonomic instability index;
fLanguage
English
Publisher
ieee
Conference_Titel
Bioinformatics and Biomedicine (BIBM), 2011 IEEE International Conference on
Conference_Location
Atlanta, GA
Print_ISBN
978-1-4577-1799-4
Type
conf
DOI
10.1109/BIBM.2011.70
Filename
6120419
Link To Document