DocumentCode :
2820972
Title :
Consistent state restoration in shared memory systems
Author :
Baldoni, Roberto ; Helary, J.-M. ; Mostefaoui, Achour ; Raynal, Michel
Author_Institution :
IRISA, Rennes, France
fYear :
1997
fDate :
19-21 Mar 1997
Firstpage :
330
Lastpage :
337
Abstract :
In many systems, backward recovery constitutes a classical technique to ensure fault-tolerance. It consists in restoring a computation in a consistent global state, saved in a global checkpoint, from which this computation can be resumed. A global checkpoint includes a set of local checkpoints, one from each process which correspond to local states dumped onto stable storage. In this paper we are interested in defining formally the domino effect for shared memory systems be the shared memory a physical one (as in multiprocessor systems) or a virtual one (as in distributed shared memory systems) and in designing a domino-free adaptive algorithm. These results lie on a necessary and sufficient condition which shows when a set of local checkpoints can belong to some consistent global checkpoint
Keywords :
shared memory systems; system recovery; backward recovery; consistent global state; domino-free adaptive algorithm; fault-tolerance; global checkpoint; shared memory systems; state restoration; Adaptive algorithm; Algorithm design and analysis; Checkpointing; Context modeling; Distributed computing; Fault tolerant systems; Kernel; Multiprocessing systems; Parallel machines; Protocols;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advances in Parallel and Distributed Computing, 1997. Proceedings
Conference_Location :
Shanghai
Print_ISBN :
0-8186-7876-3
Type :
conf
DOI :
10.1109/APDC.1997.574051
Filename :
574051
Link To Document :
بازگشت