• DocumentCode
    2484385
  • Title

    Improving MPI-HMMER´s scalability with parallel I/O

  • Author

    Walters, John Paul ; Darole, Rohan ; Chaudhary, Vipin

  • Author_Institution
    Dept. of Comput. Sci. & Eng., State Univ. of New York at Buffalo, Buffalo, NY, USA
  • fYear
    2009
  • fDate
    23-29 May 2009
  • Firstpage
    1
  • Lastpage
    11
  • Abstract
    We present PIO-HMMER, an enhanced version of MPI-HMMER. PIO-HMMER improves on MPI-HMMER´s scalability through the use of parallel I/O and a parallel file system. In addition, we describe several enhancements, including a new load balancing scheme, enhanced post-processing, improved double- buffering support, and asynchronous I/O for returning scores to the master node. Our enhancements to the core HMMER search tools, hmmsearch and hmmpfam, allow for scalability up to 256 nodes where MPI-HMMER previously did not scale beyond 64 nodes. We show that our performance enhancements allow hmmsearch to achieve between 48x and 221x speedup using 256 nodes, depending on the size of the input HMM and the database. Further, we show that by integrating database caching with PIO-HMMER´s hmmpfam tool we can achieve up to 328x performance using only 256 nodes.
  • Keywords
    cache storage; file organisation; hidden Markov models; message passing; parallel processing; MPI-HMMER; PIO-HMMER; database caching; parallel I/O; parallel file system; scalability; Acceleration; Computer science; Concurrent computing; Databases; File systems; Hidden Markov models; Load management; Message passing; Moore´s Law; Scalability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel & Distributed Processing, 2009. IPDPS 2009. IEEE International Symposium on
  • Conference_Location
    Rome
  • ISSN
    1530-2075
  • Print_ISBN
    978-1-4244-3751-1
  • Electronic_ISBN
    1530-2075
  • Type

    conf

  • DOI
    10.1109/IPDPS.2009.5161074
  • Filename
    5161074