Title :
Parallelization of a dynamic unstructured algorithm using three leading programming paradigms
Author :
Oliker, Leonid ; Biswas, Rupak
Author_Institution :
NERSC, Lawrence Berkeley Nat. Lab., CA, USA
fDate :
9/1/2000 12:00:00 AM
Abstract :
The success of parallel computing in solving real-life computationally intensive problems relies on their efficient mapping and execution on large-scale multiprocessor architectures. Many important applications are both unstructured and dynamic in nature, making their efficient parallel implementation a daunting task. This paper presents the parallelization of a dynamic unstructured mesh adaptation algorithm using three popular programming paradigms on three leading supercomputers. We examine an MPI message-passing implementation on the Cray T3E and the SGI Origin2000, a shared-memory implementation using the cache coherent nonuniform memory access (CC-NUMA) feature of the Origin2000, and a multithreaded version on the newly released Tera Multithreaded Architecture (MTA). We compare several critical factors of this parallel code development, including runtime, scalability, programmability, portability, and memory overhead. Our overall results demonstrate that multithreaded systems offer tremendous potential for quickly and efficiently solving some of the most challenging real-life problems on parallel computers.
Keywords :
parallel architectures; parallel programming; Cray T3E; MPI message-passing implementation; Origin2000; SGI Origin2000; Tera Multithreaded Architecture; cache coherent nonuniform memory access; memory overhead; multiprocessor architectures; parallel code development; parallel computers; parallel computing; parallelization; portability; programmability; scalability; shared-memory implementation; Computer architecture; Concurrent computing; Dynamic programming; Heuristic algorithms; Large-scale systems; Parallel processing; Parallel programming; Runtime; Scalability; Supercomputers;
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on