• DocumentCode
    1016417
  • Title

    Assessing the Discordance of Multiple Sequence Alignments

  • Author

    Prakash, Amol ; Tompa, Martin

  • Author_Institution
    Thermo, Biomarker Res. Initiative in Mass Spectrometry (BRIMS) Center, Cambridge, MA, USA
  • Volume
    6
  • Issue
    4
  • fYear
    2009
  • Firstpage
    542
  • Lastpage
    551
  • Abstract
    Multiple sequence alignments have wide applicability in many areas of computational biology, including comparative genomics, functional annotation of proteins, gene finding, and modeling evolutionary processes. Because of the computational difficulty of multiple sequence alignment and the availability of numerous tools, it is critical to be able to assess the reliability of multiple alignments. We present a tool called StatSigMA to assess whether multiple alignments of nucleotide or amino acid sequences are contaminated with one or more unrelated sequences. There are numerous applications for which StatSigMA can be used. Two such applications are to distinguish homologous sequences from nonhomologous ones and to compare alignments produced by various multiple alignment tools. We present examples of both types of applications.
  • Keywords
    mathematical analysis; molecular biophysics; StatSigMA tool; amino acid sequence; homologous sequences; multiple sequence alignments; nonhomologous sequences; nucleotide sequence; Biology and genetics; Karlin-Altschul statistics; Life and Medical Sciences; Multiple sequence alignment; alignment accuracy; biology and genetics; computer applications.; discordance; life and medical sciences; Algorithms; Animals; Computational Biology; Computers; Databases, Genetic; Databases, Protein; Genomics; Humans; Models, Statistical; Reproducibility of Results; Sequence Alignment; Sequence Analysis; Sequence Analysis, DNA; Sequence Analysis, Protein; Software;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2007.70271
  • Filename
    4407682