Title :
Portable checkpointing for heterogeneous architectures
Author :
Ramkumar, B. ; Strumpen, V.
Author_Institution :
Dept. of Electr. & Comput. Eng., Iowa Univ., Iowa City, IA, USA
Abstract :
Current approaches for checkpointing assume system homogeneity, where checkpointing and recovery are both performed on the same processor architecture and operating system configuration. Sometimes it is desirable or necessary to recover a failed computation on a different processor architecture. For such situations checkpointing and recovery must be portable. In this paper, we argue that source-to-source compilation is an appropriate concept for this purpose. We describe the compilation techniques that we developed for the design of the c2ftc prototype. The c2fte compiler enables machine-independent checkpoints by automatic generation of checkpointing and recovery code. Sequential C programs are compiled into fault tolerant C programs, whose checkpoints can be migrated across heterogeneous networks, and restarted on binary incompatible architectures. Experimental results on several systems provide evidence that the performance penalty of portable checkpointing is negligible for realistic checkpointing frequencies.
Keywords :
parallel architectures; program compilers; system recovery; binary incompatible architectures; c2ftc prototype; c2fte compiler; fault tolerant C programs; heterogeneous architectures; heterogeneous networks; machine-independent checkpoints; operating system configuration; performance penalty; portable checkpointing; processor architecture; recovery; sequential C programs; source-to-source compilation; system homogeneity; Checkpointing; Cities and towns; Computer architecture; Fault tolerance; Frequency; Hardware; Operating systems; Prototypes; Runtime; Space technology;
Conference_Titel :
Fault-Tolerant Computing, 1997. FTCS-27. Digest of Papers., Twenty-Seventh Annual International Symposium on
Conference_Location :
Seattle, WA, USA
Print_ISBN :
0-8186-7831-3
DOI :
10.1109/FTCS.1997.614078