DocumentCode
1754962
Title
A Knowledge-Based Multiple-Sequence Alignment Algorithm
Author
Nguyen, Khoa D. ; Yi Pan
Author_Institution
Dept. of Comput. Sci. & Inf. Technol., Clayton State Univ., Morrow, GA, USA
Volume
10
Issue
4
fYear
2013
fDate
July-Aug. 2013
Firstpage
884
Lastpage
896
Abstract
A common and cost-effective mechanism to identify the functionalities, structures, or relationships between species is multiple-sequence alignment, in which DNA/RNA/protein sequences are arranged and aligned so that similarities between sequences are clustered together. Correctly identifying and aligning these sequence biological similarities help from unwinding the mystery of species evolution to drug design. We present our knowledge-based multiple sequence alignment (KB-MSA) technique that utilizes the existing knowledge databases such as SWISSPROT, GENBANK, or HOMSTRAD to provide a more realistic and reliable sequence alignment. We also provide a modified version of this algorithm (CB-MSA) that utilizes the sequence consistency information when sequence knowledge databases are not available. Our benchmark tests on BAliBASE, PREFAB, HOMSTRAD, and SABMARK references show accuracy improvements up to 10 percent on twilight data sets against many leading alignment tools such as ISPALIGN, PADT, CLUSTALW, MAFFT, PROBCONS, and T-COFFEE.
Keywords
DNA; RNA; benchmark testing; bioinformatics; database management systems; drugs; evolution (biological); knowledge based systems; molecular biophysics; molecular configurations; proteins; BAliBASE; CB-MSA algorithm; CLUSTALW; DNA-RNA-protein sequences; GENBANK; HOMSTRAD; ISPALIGN; KB-MSA technique; MAFFT; PADT; PREFAB; PROBCONS; SABMARK; SWISSPROT; T-COFFEE; benchmark tests; drug design; knowledge databases; knowledge-based multiple-sequence alignment algorithm; species evolution; Amino acids; Bioinformatics; Computational biology; Databases; Knowledge based systems; Phylogeny; Bioinformatics; consistency MSA; knowledge-based MSA; multiple-sequence alignment; progressive MSA;
fLanguage
English
Journal_Title
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher
ieee
ISSN
1545-5963
Type
jour
DOI
10.1109/TCBB.2013.102
Filename
6583157
Link To Document