DocumentCode :
3349651
Title :
Synergistic coordination between software and hardware fault tolerance techniques
Author :
Tai, Ann T. ; Tso, Kam S. ; Alkalai, Leon ; Chau, Savio N. ; Sanders, William H.
Author_Institution :
IA Tech. Inc., Los Angeles, CA, USA
fYear :
2001
fDate :
1-4 July 2001
Firstpage :
369
Lastpage :
378
Abstract :
Describes an approach for enabling the synergistic coordination between two fault-tolerance protocols to simultaneously tolerate software and hardware faults in a distributed computing environment. Specifically, our approach is based on a message-driven confidence-driven (MDCD) protocol that we have devised for tolerating software design faults, and a time-based (TB) checkpointing protocol that was developed by N. Neves and W.K. Fuchs (1996) for tolerating hardware faults. By carrying out algorithm modifications that are conducive to synergistic coordination between volatile-storage and stable-storage checkpoint establishments, we are able to circumvent the potential interference between the MDCD and TB protocols, and to allow them to effectively complement each other to extend a system´s fault tolerance capability. Moreover, the protocol coordination approach preserves and enhances the features and advantages of the individual protocols that participate in the coordination, keeping the performance cost low.
Keywords :
fault tolerant computing; protocols; system recovery; algorithm modifications; distributed computing environment; fault-tolerance protocols; hardware fault-tolerance techniques; message-driven confidence-driven protocol; performance cost; protocol coordination approach; protocol interference; software design faults; software fault-tolerance techniques; stable storage; synergistic coordination; time-based checkpointing protocol; volatile storage; Application software; Checkpointing; Distributed computing; Fault tolerance; Fault tolerant systems; Hardware; Propulsion; Protocols; Redundancy; Software maintenance;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Dependable Systems and Networks, 2001. DSN 2001. International Conference on
Conference_Location :
Goteborg, Sweden
Print_ISBN :
0-7695-1101-5
Type :
conf
DOI :
10.1109/DSN.2001.941421
Filename :
941421
Link To Document :
بازگشت