Title :
Computing global functions in asynchronous distributed systems prone to process crashes
Author :
Helary, J.M. ; Hurfin, M. ; Mostefaoui, A. ; Raynal, M. ; Tronel, F.
Author_Institution :
IRISA, Rennes, France
Abstract :
Global data is a vector with one entry per process. Each entry must be filled with an appropriate value provided by the corresponding process. Several distributed computing problems amount to compute a function on global data. This paper proposes a protocol to solve such problems in the context of asynchronous distributed systems where processes may fail by crashing. The main problem that has to be solved lies in computing the global data and in providing each non-crashed process with a copy of it, despite the possible crash of some processes. To be consistent, the global data must contain (at least) all the values provided by the processes that do not crash. This defines the global data computation (GDC) problem. To solve this problem, processes execute a sequence of asynchronous rounds during which they construct (in a decentralized way) the value of the global data, and eventually each process gets a copy of it. To cope with process crashes, the protocol uses a perfect failure detector. The proposed protocol has been designed to be time-efficient. It allows early decisions. Let t be the maximum number of processes that may crash (t<n where n is the total number of processes) and f be the actual number of process crashes (f⩽t). In the worst case, the protocol terminates in min(2f+2,t+1) rounds. Moreover the protocol does not require processes to exchange information on their perception of crashes. The message size depends only on the size of the global data
Keywords :
distributed processing; protocols; software fault tolerance; system recovery; asynchronous distributed systems; global data; global data computation; global function computing; message size; perfect failure detector; process crashes; protocol; time-efficient; vector; Ash; Computer crashes; Detectors; Distributed computing; Electrical capacitance tomography; Protocols; Voting;
Conference_Titel :
Distributed Computing Systems, 2000. Proceedings. 20th International Conference on
Conference_Location :
Taipei
Print_ISBN :
0-7695-0601-1
DOI :
10.1109/ICDCS.2000.840973