DocumentCode :
3324657
Title :
High performance phylogenetic inference
Author :
Clement, Mark ; Snell, Quinn ; Judd, Glenn ; Whiting, Michael
Author_Institution :
Brigham Young Univ., Provo, UT, USA
fYear :
1999
fDate :
1999
Firstpage :
335
Lastpage :
336
Abstract :
Phylogenetic analysis is an integral part of many biological research programs. In essence, it is the study of gene genealogy. It is the study of gene mutation and the generational relationships. Phylogenetic analysis is being used in many diverse areas such as human epidemiology, viral transmission, biogeography, and systematics. Researchers are now commonly generating many DNA sequences from many individuals, thus creating very large data sets. However, our ability to analyze the data has not kept pace with data generation, and phylogenetics has now reached a crossroads where we cannot effectively analyze the data we generate. The chief challenge of phylogenetic systematics in the next century will be to develop algorithms and search strategies to effectively analyze large data sets. The crux of the computational problem is that the actual landscape of possible topologies can be extraordinarily difficult to evaluate with large data sets. The parsimony ratchet is actually a family of iterative tree search methods that use a statistical approach to sampling tree islands and ultimately finding the most parsimonious trees for a data set. Each iteration of the parsimony ratchet may occur in parallel as there is no direct dependency between iterations. The authors´ implementation of the parallel ratchet is masterworker based. A master process is launched in the DOGMA system which then launches worker tasks on available nodes. Each worker task is simply wrapper code that is used to interact with the newest release version of NONA
Keywords :
biology computing; distributed object management; genetics; parallel programming; statistical analysis; tree searching; DNA sequences; DOGMA system; NONA; biogeography; biological research programs; gene genealogy; gene mutation; generational relationships; high performance phylogenetic inference; human epidemiology; iterative tree search methods; master process; masterworker based; parallel ratchet; parsimonious trees; parsimony ratchet; phylogenetic analysis; phylogenetic systematics; search strategies; statistical approach; topologies; tree island sampling; very large data sets; viral transmission; worker tasks; wrapper code; Biogeography; Biological information theory; DNA; Data analysis; Genetic mutations; Humans; Iterative algorithms; Phylogeny; Sequences; Systematics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Distributed Computing, 1999. Proceedings. The Eighth International Symposium on
Conference_Location :
Redondo Beach, CA
ISSN :
1082-8907
Print_ISBN :
0-7803-5681-0
Type :
conf
DOI :
10.1109/HPDC.1999.805315
Filename :
805315
Link To Document :
بازگشت