DocumentCode :
1016417
Title :
Assessing the Discordance of Multiple Sequence Alignments
Author :
Prakash, Amol ; Tompa, Martin
Author_Institution :
Thermo, Biomarker Res. Initiative in Mass Spectrometry (BRIMS) Center, Cambridge, MA, USA
Volume :
6
Issue :
4
fYear :
2009
Firstpage :
542
Lastpage :
551
Abstract :
Multiple sequence alignments have wide applicability in many areas of computational biology, including comparative genomics, functional annotation of proteins, gene finding, and modeling evolutionary processes. Because of the computational difficulty of multiple sequence alignment and the availability of numerous tools, it is critical to be able to assess the reliability of multiple alignments. We present a tool called StatSigMA to assess whether multiple alignments of nucleotide or amino acid sequences are contaminated with one or more unrelated sequences. There are numerous applications for which StatSigMA can be used. Two such applications are to distinguish homologous sequences from nonhomologous ones and to compare alignments produced by various multiple alignment tools. We present examples of both types of applications.
Keywords :
mathematical analysis; molecular biophysics; StatSigMA tool; amino acid sequence; homologous sequences; multiple sequence alignments; nonhomologous sequences; nucleotide sequence; Biology and genetics; Karlin-Altschul statistics; Life and Medical Sciences; Multiple sequence alignment; alignment accuracy; biology and genetics; computer applications.; discordance; life and medical sciences; Algorithms; Animals; Computational Biology; Computers; Databases, Genetic; Databases, Protein; Genomics; Humans; Models, Statistical; Reproducibility of Results; Sequence Alignment; Sequence Analysis; Sequence Analysis, DNA; Sequence Analysis, Protein; Software;
fLanguage :
English
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
1545-5963
Type :
jour
DOI :
10.1109/TCBB.2007.70271
Filename :
4407682
Link To Document :
بازگشت