Title :
Dynamic Load Balancing for Unstructured Meshes on Space-Filling Curves
Author :
Harlacher, Daniel F. ; Klimach, Harald ; Roller, Sabine ; Siebert, Christian ; Wolf, Felix
Author_Institution :
Appl. Supercomput. in Eng., RWTH Aachen Univ., Aachen, Germany
Abstract :
Load imbalance is an important impediment on the path towards higher degrees of parallelism - especially for engineering codes with their highly unstructured problem domains. In particular, when load conditions change dynamically, efficient mesh partitioning becomes an indispensable ingredient of scalable design. However, popular graph-based methods such as those used by ParMetis require global knowledge, which effectively limits the problem size on distributed-memory machines. On such architectures, space-filling curves (SFCs) offer a memory-efficient alternative and many sophisticated schemes have already been proposed. In this paper, we present a simple strategy based on SFCs that is custom-tailored to the needs of static meshes with dynamically changing computational load. Exploiting the properties of this class of problems, it is not only easy to implement but also reduces memory requirements substantially. Moreover, exclusively relying on MPI collective operations, our load-balancing scheme also offers portable performance across a broad range of HPC systems. Experimental evaluation shows excellent scaling behavior for up to 16,384 cores on a Nehalem-Infiniband system and up to 294,912 processes on a Blue Gene/P system.
Keywords :
distributed memory systems; message passing; parallel machines; resource allocation; storage management; Blue Gene/P system; HPC system; MPI collective operation; Nehalem-Infiniband system; ParMetis; distributed-memory machine; dynamic load balancing; dynamically changing computational load; engineering codes; graph-based method; load imbalance; memory requirements; memory-efficient alternative; mesh partitioning; parallelism; portable performance; scalable design; space-filling curve; static mesh; unstructured mesh; Computational modeling; Heuristic algorithms; Load management; Load modeling; Memory management; Partitioning algorithms; Scalability; load balancing; partitioning; scalability; space-filling curve;
Conference_Titel :
Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2012 IEEE 26th International
Conference_Location :
Shanghai
Print_ISBN :
978-1-4673-0974-5
DOI :
10.1109/IPDPSW.2012.207