DocumentCode :
1244058
Title :
Message-optimal protocols for fault-tolerant broadcasts/multicasts in distributed systems with crash failures
Author :
Tzeng, Hong-Yi ; Siu, Kai-Yeung
Author_Institution :
Dept. of Electr. & Comput. Eng., California Univ., Irvine, CA, USA
Volume :
44
Issue :
2
fYear :
1995
fDate :
2/1/1995 12:00:00 AM
Firstpage :
346
Lastpage :
352
Abstract :
An essential feature in any fault tolerant design of distributed systems is a mechanism by which a process can reliably broadcast information to other processes in the presence of failures. The paper studies the message complexity of fault tolerant broadcast protocols in weakly synchronous and totally asynchronous distributed systems with point to point communication links, where the system failures are caused by the processes but the communication links are completely reliable. We focus on the number of messages required of any fault tolerant protocol in failure free executions. Our motivation is that one should incur the cost of handling failures only when they actually occur. We present protocols that, in an n-process system subject to at most t crash failures where 1⩽t<(n-1), guarantee the delivery of a message from any process to other nonfaulty processes. In the absence of crash failures, our protocols require (n+t-1) messages in the weakly synchronous model and (t+1)(n--1-(t/2)) messages in the totally asynchronous model. Moreover, we show that in both cases our protocols are optimal with respect to message complexity. The new insights provided in our lower bound proofs also yield graph-theoretic characterizations of all message optimal reliable broadcast protocols in failure free executions. Both the upper and lower bound results on broadcast protocols can be generalized to multicast protocols, where a process only needs to deliver a message to a subset of processes in the system
Keywords :
broadcasting; communication complexity; fault tolerant computing; graph theory; protocols; system recovery; crash failures; distributed systems; failure free executions; fault tolerant broadcast protocols; fault tolerant design; fault-tolerant broadcasts/multicasts; message complexity; message optimal reliable broadcast protocols; message-optimal protocols; multicast protocols; point to point communication links; system failures; Broadcasting; Computer crashes; Costs; Distributed computing; Fault tolerance; Fault tolerant systems; Multicast protocols;
fLanguage :
English
Journal_Title :
Computers, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9340
Type :
jour
DOI :
10.1109/12.364545
Filename :
364545
Link To Document :
بازگشت