DocumentCode :
1442278
Title :
A Consensus Tree Approach for Reconstructing Human Evolutionary History and Detecting Population Substructure
Author :
Tsai, Ming-Chi ; Blelloch, Guy ; Ravi, R. ; Schwartz, Russell
Author_Institution :
Lane Center for Comput. Biol., Joint CMU-Univ. of Pittsburgh Program in Comput. Biol., Carnegie Mellon Univ., Pittsburgh, PA, USA
Volume :
8
Issue :
4
fYear :
2011
Firstpage :
918
Lastpage :
928
Abstract :
The random accumulation of variations in the human genome over time implicitly encodes a history of how human populations have arisen, dispersed, and intermixed since we emerged as a species. Reconstructing that history is a challenging computational and statistical problem but has important applications both to basic research and to the discovery of genotype-phenotype correlations. We present a novel approach to inferring human evolutionary history from genetic variation data. We use the idea of consensus trees, a technique generally used to reconcile species trees from divergent gene trees, adapting it to the problem of finding robust relationships within a set of intraspecies phylogenies derived from local regions of the genome. Validation on both simulated and real data shows the method to be effective in recapitulating known true structure of the data closely matching our best current understanding of human evolutionary history. Additional comparison with results of leading methods for the problem of population substructure assignment verifies that our method provides comparable accuracy in identifying meaningful population subgroups in addition to inferring relationships among them. The consensus tree approach thus provides a promising new model for the robust inference of substructure and ancestry from large-scale genetic variation data.
Keywords :
biology computing; evolution (biological); genetics; genomics; ancestry; consensus tree approach; genetic variation data; genotype-phenotype correlations; human evolutionary history reconstruction; human genome; human populations; intraspecies phylogenies; population substructure detection; random accumulation; Bioinformatics; Computational modeling; Data models; Genetics; History; Humans; Phylogeny; Biology and genetics; graph algorithms.; information theory; trees; Algorithms; Cluster Analysis; Computational Biology; Computer Simulation; Evolution, Molecular; Genetics, Population; Genome, Human; Humans; Information Theory; Models, Genetic; Models, Statistical; Phylogeny; Reproducibility of Results;
fLanguage :
English
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
1545-5963
Type :
jour
DOI :
10.1109/TCBB.2011.23
Filename :
5708136
Link To Document :
بازگشت