Title :
Assessing the Discordance of Multiple Sequence Alignments
Author :
Prakash, Amol ; Tompa, Martin
Author_Institution :
Thermo, Biomarker Res. Initiative in Mass Spectrometry (BRIMS) Center, Cambridge, MA, USA
Abstract :
Multiple sequence alignments have wide applicability in many areas of computational biology, including comparative genomics, functional annotation of proteins, gene finding, and modeling evolutionary processes. Because of the computational difficulty of multiple sequence alignment and the availability of numerous tools, it is critical to be able to assess the reliability of multiple alignments. We present a tool called StatSigMA to assess whether multiple alignments of nucleotide or amino acid sequences are contaminated with one or more unrelated sequences. There are numerous applications for which StatSigMA can be used. Two such applications are to distinguish homologous sequences from nonhomologous ones and to compare alignments produced by various multiple alignment tools. We present examples of both types of applications.
Keywords :
mathematical analysis; molecular biophysics; StatSigMA tool; amino acid sequence; homologous sequences; multiple sequence alignments; nonhomologous sequences; nucleotide sequence; Biology and genetics; Karlin-Altschul statistics; Life and Medical Sciences; Multiple sequence alignment; alignment accuracy; biology and genetics; computer applications.; discordance; life and medical sciences; Algorithms; Animals; Computational Biology; Computers; Databases, Genetic; Databases, Protein; Genomics; Humans; Models, Statistical; Reproducibility of Results; Sequence Alignment; Sequence Analysis; Sequence Analysis, DNA; Sequence Analysis, Protein; Software;
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
DOI :
10.1109/TCBB.2007.70271