DocumentCode
2530891
Title
A Divide-and-Conquer Implementation of Three Sequence Alignment and Ancestor Inference
Author
Yue, Feng ; Tang, Jijun
fYear
2007
fDate
2-4 Nov. 2007
Firstpage
143
Lastpage
150
Abstract
In this paper, we present an algorithm to simultaneously align three biological sequences with affine gap model and infer their common ancestral sequence. Our algorithm can be further extended to perform tree alignment for more se- quences, and eventually unify the two procedures of phylo- genetic reconstruction and sequence alignment. The nov- elty of our algorithm is: it applies the divide-and-conquer strategy so that the memory usage is reduced from O (n3) to O (n2), while at the same time, it is based on dynamic programming and optimal alignment is guaranteed. Tra- ditionally, three sequence alignment is limited by the huge demand of memory space and can only handle sequences less than two hundred characters long. With the new im- proved algorithm, we can produce the optimal alignment of sequences of several thousand characters long. We implemented our algorithm as a C program package MSAM . It has been extensively tested with BAliBASE, a real manually refined multiple sequence alignment database, as well as simulated datasets generated by Rose (Ran- dom Model of Sequence Evolution). We compared our re- sults with those of other popular multiple sequence align- ment tools, including the widely used programs such as ClustalW and T-Coffee. The experiment shows that MSAM produces not only better alignment, but also better ancestral sequence. The software can be downloaded for free at http://www.cse.sc.edu/phylo/MSAM.html
Keywords
Binary trees; Biological system modeling; Biomedical engineering; Computer science; Costs; Evolution (biology); Genetic mutations; Phylogeny; Sequences; USA Councils;
fLanguage
English
Publisher
ieee
Conference_Titel
Bioinformatics and Biomedicine, 2007. BIBM 2007. IEEE International Conference on
Conference_Location
Fremont, CA
Print_ISBN
978-0-7695-3031-4
Type
conf
DOI
10.1109/BIBM.2007.40
Filename
4413048
Link To Document