• DocumentCode
    633050
  • Title

    Protein database search optimization based on CUDA and MPI

  • Author

    Pavlovic, D. ; Vaser, Robert ; Korpar, Matija ; Sikic, Mile

  • Author_Institution
    Fac. of Electr. Eng. & Comput., Univ. of Zagreb, Zagreb, Croatia
  • fYear
    2013
  • fDate
    20-24 May 2013
  • Firstpage
    1278
  • Lastpage
    1280
  • Abstract
    Protein database search is an important method in the field of computational biology. There are a large number of sequences in an average database which makes such searches rather time and resource consuming. With the rapid growth in size of these databases in the past years, there came a need to speed up the search and consequently, any alignments performed on such databases.This paper presents an acceleration of the database search tool sw#DB which is based on a CUDA implementation of Smith-Waterman algorithm. We achieved speed up by reducing database size. The whole database was divided into seeds of a fixed length. The positions of these seeds and the corresponding sequence indexes from the database are then stored in a hash container. This allows for a constant time lookup of all the positions of a seed in every sequence of a database. Potential alignment candidate sequences for a query are filtered using this method, forwarding only those which contain at least one seed from the query to the sw#DB. This reduces the number of alignments performed. Overall, it brings a speedup of around three times compared to the basic sw#DB tool, based solely on Smith Waterman algorithm, with almost no loss of accuracy. The implementation is written in CUDA and C programming languages. For large queries, MPI implementation with multiple CUDA cards is used.
  • Keywords
    C language; application program interfaces; biology computing; information filtering; message passing; parallel algorithms; parallel architectures; proteins; query processing; CUDA implementation; MPI; computational biology; database search tool; protein database search optimization; sw#DB; Accuracy; Graphics processing units; Indexes; Optimization; Proteins; Software algorithms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information & Communication Technology Electronics & Microelectronics (MIPRO), 2013 36th International Convention on
  • Conference_Location
    Opatija
  • Print_ISBN
    978-953-233-076-2
  • Type

    conf

  • Filename
    6596454