• DocumentCode
    3136204
  • Title

    Divide-and-Conquer Algorithm for Clustalw-MPI

  • Author

    Rezaei, Siamak ; Monwar, Md Maruf

  • Author_Institution
    Comput. Sci., Univ. of Northern British Columbia, BC
  • fYear
    2006
  • fDate
    38838
  • Firstpage
    717
  • Lastpage
    720
  • Abstract
    Multiple sequence alignment continues to be an active field of research in computational biology and the most widely used tool for multiple sequence alignment is ClustalW, which achieves alignment via three steps: pair wise alignment, guide tree generation and progressive alignment. ClustalW-MPI is a parallel implementation of ClustalW. In this paper, a new approach, divide-and-conquer, is implemented which uses ClustalW-MPI for sequence alignment but it gets a better speed up performance than ClustalW-MPI. In this approach, the sequences are first cut down into smaller subsequences by divide-and-conquer technique to minimize the computational space. Then these subsequences are sent to different available processors using message passing interface technique. Those processors align the subsequences by executing ClustalW-MPI simultaneously. After aligning, the results are then sent to the main processor to be concatenated to produce the final alignment. But some quality of the alignment may be compromised in this approach for the introduction of gaps at the start or end of subsequences aligned. Therefore, some heuristic methods for fixing the cut points were suggested for future improvement, such as overlapping alignment and sliding window alignment
  • Keywords
    application program interfaces; biology computing; divide and conquer methods; genetics; message passing; sequences; trees (mathematics); ClustalW-MPI; computational biology; divide-and-conquer algorithm; guide tree generation; message passing interface technique; multiple sequence alignment; pair wise alignment; progressive alignment; Bioinformatics; Computational biology; Computer science; Concatenated codes; Dynamic programming; Genomics; Heuristic algorithms; Message passing; Proteins; Sequences; Divide-and-Conquer algorithm; message passing interface; multiple sequence alignment; multiple tree alignment;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electrical and Computer Engineering, 2006. CCECE '06. Canadian Conference on
  • Conference_Location
    Ottawa, Ont.
  • Print_ISBN
    1-4244-0038-4
  • Electronic_ISBN
    1-4244-0038-4
  • Type

    conf

  • DOI
    10.1109/CCECE.2006.277630
  • Filename
    4054650