• DocumentCode
    495668
  • Title

    Improving Kalign via Reconstruction of Phylogenetic Tree and Iteration

  • Author

    Yang, Fan ; Zhu, Qingxin ; Zhao, Mingyuan

  • Author_Institution
    Sch. of Comput. Sci. & Eng., Univ. of Electron. Sci. & Technol. of China, Chengdu, China
  • Volume
    1
  • fYear
    2009
  • fDate
    March 31 2009-April 2 2009
  • Firstpage
    625
  • Lastpage
    629
  • Abstract
    The multiple sequence alignment of DNA or protein sequences is one of the fundamental research topics in bioinformatics. Kalign is an widely used multiple sequence alignment method employing the Wu-Manber approximate string matching algorithm, which improves both the accuracy and speed of multiple sequence alignment, and it is especially well suited for the task of aligning large numbers of sequences or divergent sequences. However, the alignment quality is poor because of the inaccurate estimate of the distances between sequences. In this paper, a novel similarity measure based on matching protein subsequences is presented. Then an iterative algorithm, which combines re-estimation of distance and reconstruction of phylogenetic tree, is introduced to refine the alignment created by Kalign. As the result of experiment, we use the BAliBASE 3.0 alignment benchmark set for the assessment of our method. The result shows that our algorithm achieves more accurate alignment than Kalign does.
  • Keywords
    DNA; bioinformatics; genetics; iterative methods; molecular biophysics; proteins; BAliBASE 3.0 alignment; DNA sequence; Kalign; Wu-Manber approximate string matching algorithm; bioinformatics; iterative algorithm; multiple sequence alignment; phylogenetic tree reconstruction; protein sequence; Assembly; Benchmark testing; Bioinformatics; Computer science; DNA; Iterative algorithms; Libraries; Phylogeny; Proteins; Sequences; Kalign; iteration; multiple sequence alignment; similarity measure;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Information Engineering, 2009 WRI World Congress on
  • Conference_Location
    Los Angeles, CA
  • Print_ISBN
    978-0-7695-3507-4
  • Type

    conf

  • DOI
    10.1109/CSIE.2009.291
  • Filename
    5171247