Title :
CoCheck: checkpointing and process migration for MPI
Author_Institution :
Inst. fur Inf., Tech. Univ. Munchen, Germany
Abstract :
Checkpointing of parallel applications can be used as the core technology to provide process migration. Both checkpointing and migration, are an important issue for parallel applications on networks of workstations. The CoCheck environment which we present in this paper introduces a new approach to provide checkpointing and migration for parallel applications. CoCheck sits on top of the message passing library and achieves consistency at a level above the message passing system. It uses an existing single process checkpointer which is available for a wide range of systems. Hence, CoCheck can be easily adapted to both, different message passing systems and new machines
Keywords :
local area networks; message passing; parallel machines; resource allocation; software libraries; CoCheck; LAN; MPI; checkpointing; consistency; local area networks; message passing library; parallel applications; process migration; single process checkpointer; workstation networks; Availability; Checkpointing; Concurrent computing; Distributed computing; Libraries; Message passing; Programming environments; Resumes; Workstations;
Conference_Titel :
Parallel Processing Symposium, 1996., Proceedings of IPPS '96, The 10th International
Conference_Location :
Honolulu, HI
Print_ISBN :
0-8186-7255-2
DOI :
10.1109/IPPS.1996.508106