DocumentCode :
3148582
Title :
Optimal object state transfer - recovery policies for fault tolerant distributed systems
Author :
Katsaros, Panagiotis ; Lazos, Constantine
Author_Institution :
Dept. of Informatics, Aristotle Univ. of Thessaloniki, Greece
fYear :
2004
fDate :
28 June-1 July 2004
Firstpage :
762
Lastpage :
771
Abstract :
Recent developments in the field of object-based fault tolerance and the advent of the first OMG FT-CORBA compliant middleware raise new requirements for the design process of distributed fault-tolerant systems. In this work, we introduce a simulation-based design approach based on the optimum effectiveness of the compared fault tolerance schemes. Each scheme is defined as a set of fault tolerance properties for the objects that compose the system. Its optimum effectiveness is determined by the tightest effective checkpoint intervals, for the passively replicated objects. Our approach allows mixing miscellaneous fault tolerance policies, as opposed to the published analytic models, which are best suited in the evaluation of single-server process replication schemes. Special emphasis has been given to the accuracy of the generated estimates using an appropriate simulation output analysis procedure. We provide showcase results and compare two characteristic warm passive replication schemes: one with periodic and another one with load-dependent object state checkpoints. Finally, a trade-off analysis is applied, for determining appropriate checkpoint properties, in respect to a specified design goal.
Keywords :
checkpointing; distributed object management; middleware; software fault tolerance; OMG FT-CORBA; checkpointing; fault tolerant distributed systems; middleware; object replication; object state checkpoints; object-based fault tolerance; optimal object state transfer; single-server process replication; trade-off analysis; Analytical models; Application software; Delay; Fault tolerance; Fault tolerant systems; Informatics; Middleware; Process design; Robustness; Software systems;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Dependable Systems and Networks, 2004 International Conference on
Print_ISBN :
0-7695-2052-9
Type :
conf
DOI :
10.1109/DSN.2004.1311947
Filename :
1311947
Link To Document :
بازگشت