Title :
An Ω(n^2/ log n) Speed-Up of TBR Heuristics for the Gene-Duplication Problem
Author :
Bansal, Mukul S. ; Eulenst, Oliver
Author_Institution :
Dept. of Comput. Sci., Iowa State Univ., Ames, IA
Abstract :
The gene-duplication problem is to infer a species supertree from gene trees that are confounded by complex histories of gene duplications. This problem is NP-complete and thus requires efficient and effective heuristics. Existing heuristics perform a stepwise search of the tree space, where each step is guided by an exact solution to an instance of a local search problem. We improve on the time complexity of the local search problem by a factor of n2/logn, where n is the size of the resulting species supertree. Typically, several thousand instances of the local search problem are solved throughout a stepwise heuristic search. Hence, our improvement makes the gene-duplication problem much more tractable for large-scale phylogenetic analyses.
Keywords :
biology computing; computational complexity; genetics; heuristic programming; trees (mathematics); NP-hard problem; TBR heuristics; gene-duplication problem; large-scale phylogenetic analysis; local search problem; stepwise heuristic search; supertrees; Algorithms; Computational Biology; Gene Duplication; Phylogenetics; Supertrees; Algorithms; Base Sequence; Chromosome Mapping; Gene Duplication; Molecular Sequence Data; Sequence Analysis, DNA;
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
DOI :
10.1109/TCBB.2008.69