Title :
The Hector distributed run-time environment
Author :
Russ, Samuel H. ; Robinson, Jonathan ; Flachs, Brian K. ; Heckel, Bjorn
Author_Institution :
Eng. Res. Center, Mississippi State Univ., MS, USA
fDate :
11/1/1998 12:00:00 AM
Abstract :
Harnessing the computational capabilities of a network of workstations promises to off-load work from overloaded supercomputers onto largely idle resources overnight. Several capabilities are needed to do this, including support for an architecture-independent parallel programming environment, task migration, automatic resource allocation, and fault tolerance. The Hector distributed run-time environment is designed to present these capabilities transparently to programmers. MPI programs can be run under this environment on homogeneous clusters with no modifications to their source code needed. The design of Hector, its internal structure, and several benchmarks and tests are presented
Keywords :
fault tolerant computing; parallel programming; programming environments; resource allocation; Hector distributed run-time environment; MPI programs; architecture-independent parallel programming environment; automatic resource allocation; benchmarks; fault tolerance; network of workstations; task migration; Availability; Computer networks; Fault tolerance; Load management; Parallel programming; Programming profession; Resource management; Runtime environment; Supercomputers; Workstations;
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on