• DocumentCode
    723709
  • Title

    Scheduling the I/O of HPC Applications Under Congestion

  • Author

    Gainaru, Ana ; Aupy, Guillaume ; Benoit, Anne ; Cappello, Franck ; Robert, Yves ; Snir, Marc

  • Author_Institution
    Univ. of Illinois at Urbana Champaign, Champaign, IL, USA
  • fYear
    2015
  • fDate
    25-29 May 2015
  • Firstpage
    1013
  • Lastpage
    1022
  • Abstract
    A significant percentage of the computing capacity of large-scale platforms is wasted because of interferences incurred by multiple applications that access a shared parallel file system concurrently. One solution to handling I/O bursts enlarge-scale HPC systems is to absorb them at an intermediate storage layer consisting of burst buffers. However, our analysis of the Argonne´s Mira system shows that burst buffers cannot prevent congestion at all times. Consequently, I/O performances dramatically degraded, showing in some cases a decrease in I/O throughput of 67%. In this paper, we analyze the effects of interference on application I/O bandwidth and propose several scheduling techniques to mitigate congestion. We show through extensive experiments that our global I/O scheduler is able to reduce the effects of congestion, even on systems where burst buffers are used, and can increase the overall system throughput up to 56%. We also show that it outperforms current Mira I/O schedulers.
  • Keywords
    buffer storage; parallel processing; scheduling; HPC applications; I/O bursts enlarge-scale HPC systems; I/O scheduling; Mira system; burst buffers; computing capacity; congestion mitigation; global I/O scheduler; intermediate storage layer; large-scale platforms; shared parallel file system; Bandwidth; Computational modeling; Interference; Optimization; Processor scheduling; Program processors; Throughput; HPC application performance; I/O congestion; I/O scheduler; burst buffers;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Symposium (IPDPS), 2015 IEEE International
  • Conference_Location
    Hyderabad
  • ISSN
    1530-2075
  • Type

    conf

  • DOI
    10.1109/IPDPS.2015.116
  • Filename
    7161586