• DocumentCode
    2790726
  • Title

    Improving MPI Independent Write Performance Using A Two-Stage Write-Behind Buffering Method

  • Author

    Liao, Wei-keng ; Ching, Avery ; Coloma, Kenin ; Choudhary, Alok ; Kandemir, Mahmut

  • Author_Institution
    Dept. of Electr. Eng. & Comput. Sci., Northwestern Univ., Evanston, IL
  • fYear
    2007
  • fDate
    26-30 March 2007
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Many large-scale production applications often have very long executions times and require periodic data checkpoints in order to save the state of the computation for program restart and/or tracing application progress. These write-only operations often dominate the overall application runtime, which makes them a good optimization target. Existing approaches for write-behind data buffering at the MPI I/O level have been proposed, but challenges still exist for addressing system-level I/O issues. We propose a two-stage write-behind buffering scheme for handing checkpoint operations. The first-stage of buffering accumulates write data for better network utilization and the second-stage of buffering enables the alignment for the write requests to the file stripe boundaries. Aligned I/O requests avoid file lock contention that can seriously degrade I/O performance. We present our performance evaluation using BTIO benchmarks on both GPFS and Lustre file systems. With the two-stage buffering, the performance of BTIO through MPI independent I/O is significantly improved and even surpasses that of collective I/O.
  • Keywords
    application program interfaces; checkpointing; message passing; BTIO benchmark; GPFS; Lustre file system; MPI independent write performance; data checkpoint; write-behind buffering method; Application software; Checkpointing; Computer science; Data engineering; Degradation; File systems; Large-scale systems; Message passing; Production; Runtime;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Symposium, 2007. IPDPS 2007. IEEE International
  • Conference_Location
    Long Beach, CA
  • Print_ISBN
    1-4244-0910-1
  • Electronic_ISBN
    1-4244-0910-1
  • Type

    conf

  • DOI
    10.1109/IPDPS.2007.370485
  • Filename
    4228213