• DocumentCode
    3520069
  • Title

    Conservative, Non-conservative and Average Pairwise Statistical Significance of Local Sequence Alignment

  • Author

    Agrawal, Ankit ; Huang, Xiaoqiu

  • Author_Institution
    Dept. of Comput. Sci., Iowa State Univ., Ames, IA
  • fYear
    2008
  • fDate
    3-5 Nov. 2008
  • Firstpage
    433
  • Lastpage
    436
  • Abstract
    Estimation of statistical significance of a pairwise alignment is an important problem in sequence comparison. Recently, it was shown that pairwise statistical significance does better in practice than database statistical significance in terms of retrieval accuracy of homologs. In this paper, we introduce the concept of conservative, non-conservative, and average pairwise statistical significance which can be easily derived from original pairwise statistical significance estimates and use more information specific to the sequence pair under consideration using multiple shuffle spaces. Experimental results for homology detection reveal that the proposed measures give at least comparable or significantly better retrieval accuracy than original pairwise statistical significance and database statistical significance reported by BLAST, PSI-BLAST, and SSEARCH. The use of the proposed measures is further shown to be extremely useful when using sequence-specific substitution matrices.
  • Keywords
    DNA; biology computing; molecular biophysics; proteins; statistical analysis; BLAST; PSI-BLAST; SSEARCH; database statistical significance; homology detection; local sequence alignment; multiple shuffle spaces; pairwise alignment; pairwise statistical significance; retrieval accuracy; sequence-specific substitution matrix; Bioinformatics; Biomedical measurements; Computer science; Databases; Information retrieval; Length measurement; Maximum likelihood estimation; Sequences; State estimation; USA Councils; Database statistical significance; Homologs; Pairwise statistical significance; Sequence Alignment;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedicine, 2008. BIBM '08. IEEE International Conference on
  • Conference_Location
    Philadelphia, PA
  • Print_ISBN
    978-0-7695-3452-7
  • Type

    conf

  • DOI
    10.1109/BIBM.2008.19
  • Filename
    4684934