Title :
Process/thread migration and checkpointing in heterogeneous distributed systems
Author :
Jiang, Hai ; Chaudhary, Vipin
Author_Institution :
Inst. for Sci. Comput., Wayne State Univ., Detroit, MI, USA
Abstract :
Process/thread migration and checkpointing are indispensable for resource sharing, cycle stealing, and other modes of interaction. To provide a flexible, transparent, and portable solution in heterogeneous environments, we have developed a multi-grained migration/checkpointing package, MigThread, which can migrate/checkpoint multiple threads to different machines or file systems simultaneously, and also perform single coarse-grained process migration/checkpointing. For scalability and portability, computation states are extracted out of their original places and abstracted to the language level. With the user-level stack/heap management, MigThread does not rely on any thread libraries and operating systems. For heterogeneity, a novel data conversion scheme is proposed to analyze data types automatically and convert data only on the receiver side. For safety, MigThread detects and overcomes "unsafe" factors to qualify virtually all C programs for migration/checkpointing. Some performance measurements are given to illustrate its effectiveness.
Keywords :
checkpointing; distributed processing; MigThread; cycle stealing; heterogeneous distributed systems; multigrained checkpointing; multigrained migration; resource sharing; single coarse-grained process checkpointing; single coarse-grained process migration; thread checkpointing; thread migration; Checkpointing; Data conversion; Data mining; File systems; Libraries; Operating systems; Packaging machines; Resource management; Scalability; Yarn;
Conference_Titel :
System Sciences, 2004. Proceedings of the 37th Annual Hawaii International Conference on
Print_ISBN :
0-7695-2056-1
DOI :
10.1109/HICSS.2004.1265665