Title :
New hybrid fault models for asynchronous approximate agreement
Author :
Azadmanesh, M.H. ; Kieckhafer, R.M.
Author_Institution :
Centre for Manage. of Inf. Technol., Nebraska Univ., Omaha, NE, USA
fDate :
4/1/1996 12:00:00 AM
Abstract :
An important problem in fault-tolerant distributed systems is maintaining agreement between nonfaulty processes in the presence of undiagnosed faults. To achieve agreement, processes exchange their local “opinions” of a particular value, and then vote on the values received to arrive at a “consensus”. Approximate agreement defines a condition in which it is not necessary for consensus values to be identical. Rather, it is only necessary that they agree to within a predefined tolerance. Approximate agreement can be achieved through a sequence of convergent voting rounds, in which the range of values held by nonfaulty nodes is reduced in each round. Research has revealed simple expressions for the convergence rate and fault tolerance of a broad family of convergent voting algorithms called Mean-Subsequence-Reduced (MSR) algorithms. These results were derived under the Thambidurai and Park (1988) hybrid fault model comprised of asymmetric, symmetric, and benign faults. However, these results apply only to synchronous systems, in which there is a known finite bound on computation and communications times. We extend the previous results to asynchronous systems, in which no such bound exists. In addition, we introduce two new hybrid fault models which further differentiate between omissive faults and transmissive faults. The new fault models permit tighter bounds on the fault-tolerance of asynchronous systems to be derived
Keywords :
computational complexity; convergence; fault tolerant computing; multiprocessing systems; synchronisation; Mean-Subsequence-Reduced algorithms; asymmetric faults; asynchronous approximate agreement; asynchronous systems; benign faults; communications times; computation times; consensus; convergence rate; convergent voting rounds; fault tolerance; fault-tolerant distributed systems; hybrid fault models; multiprocessing systems; nonfaulty processes; omissive faults; symmetric faults; transmissive faults; undiagnosed faults; voting algorithms; Clocks; Computer Society; Convergence; Electronic switching systems; Fault tolerance; Fault tolerant systems; Information management; Synchronization; Technology management; Voting;
Journal_Title :
Computers, IEEE Transactions on