• DocumentCode
    3322912
  • Title

    Hybrid MPI/OpenMP Parallel Programming on Clusters of Multi-Core SMP Nodes

  • Author

    Rabenseifner, Rolf ; Hager, Georg ; Jost, Gabriele

  • Author_Institution
    High Performance Comput. Center Stuttgart (HLRS)
  • fYear
    2009
  • fDate
    18-20 Feb. 2009
  • Firstpage
    427
  • Lastpage
    436
  • Abstract
    Today most systems in high-performance computing (HPC) feature a hierarchical hardware design: Shared memory nodes with several multi-core CPUs are connected via a network infrastructure. Parallel programming must combine distributed memory parallelization on the node interconnect with shared memory parallelization inside each node. We describe potentials and challenges of the dominant programming models on hierarchically structured hardware: Pure MPI (message passing interface), pure OpenMP (with distributed shared memory extensions) and hybrid MPI+OpenMP in several flavors. We pinpoint cases where a hybrid programming model can indeed be the superior solution because of reduced communication needs and memory consumption, or improved load balance. Furthermore we show that machine topology has a significant impact on performance for all parallelization strategies and that topology awareness should be built into all applications in the future. Finally we give an outlook on possible standardization goals and extensions that could make hybrid programming easier to do with performance in mind.
  • Keywords
    application program interfaces; distributed memory systems; message passing; parallel programming; resource allocation; shared memory systems; MPI-OpenMP parallel programming; distributed memory parallelization; high-performance computing; load balance improvement; message passing interface; multicore SMP nodes; shared memory parallelization; Computer networks; Concurrent computing; Distributed computing; Hardware; High performance computing; Message passing; Parallel programming; Taxonomy; Topology; Yarn; Hybrid programming; MPI; OpenMP; SMP; multi-core;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel, Distributed and Network-based Processing, 2009 17th Euromicro International Conference on
  • Conference_Location
    Weimar
  • ISSN
    1066-6192
  • Print_ISBN
    978-0-7695-3544-9
  • Type

    conf

  • DOI
    10.1109/PDP.2009.43
  • Filename
    4912964