DocumentCode
1962145
Title
A fundamental failure model for fault-tolerant protocols
Author
Echtle, Klaus ; Masum, Asif
Author_Institution
FB6, Essen Univ., Germany
fYear
2000
fDate
2000
Firstpage
69
Lastpage
78
Abstract
The application area of distributed systems determines the extent to which protocols must provide fault detection and/or fault tolerance. Highest dependability can not be obtained without the cost of a substantial overhead. In order to reduce the message number and the time consumption, protocols should be tailored best to application requirements and system properties. This paper presents a novel failure classification as an instrument to limit fault detection and tolerance features to a reasonable failure set. Evaluation of protocols shows that just exclusion of “exotic” failures, which are most unlikely to occur enable a drastic increase in efficiency. Unlike other approaches, our failure classification is based on a completely functional model and on the definition of so-called failure capabilities. This overcomes the limitations of strictly hierarchic and time/value-based models. The new approach provides a framework to precisely specify common failure assumptions as well as very specialized scenarios-in particular so-called non-cooperative Byzantine failures
Keywords
distributed processing; fault tolerant computing; protocols; Byzantine failures; common failure assumptions; completely functional model; distributed systems; fault detection; fault tolerance; fault-tolerant protocols; fundamental failure model; system properties; time consumption; Costs; Design methodology; Fault detection; Fault tolerance; Fault tolerant systems; Instruments; Protocols; Tail; Technological innovation;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Performance and Dependability Symposium, 2000. IPDS 2000. Proceedings. IEEE International
Conference_Location
Chicago, IL
ISSN
1087-2191
Print_ISBN
0-7695-0553-8
Type
conf
DOI
10.1109/IPDS.2000.839465
Filename
839465
Link To Document