DocumentCode
320075
Title
A fault-tolerant distributed algorithm for termination detection using roughly synchronized clocks
Author
Dash, P.E. ; Hansdah, R.C.
Author_Institution
Software Dev. Center, Motorola India Electron. Ltd., Bangalore, India
fYear
1997
fDate
10-13 Dec 1997
Firstpage
736
Lastpage
743
Abstract
A fully symmetric and distributed solution to the termination detection problem is presented along with a proof of its correctness. Each of the processors in an asynchronous (or synchronous) network has a physical clock which is roughly synchronized with all other processors´ physical clocks. The control messages that are timestamped using roughly synchronized clocks are circulated around an unidirectional (logical or physical) ring of all processors to detect termination of the underlying computation. The algorithm detects termination along with the correctness of the result of the underlying computation even in the situations when the processors fail and recover. The algorithm also works correctly even after the failure of clock synchronization. This algorithm involves considerably less number of messages to detect global termination in any network (synchronous or asynchronous) compared to other similar algorithms in the literature
Keywords
distributed algorithms; fault tolerant computing; program verification; synchronisation; control messages; correctness proof; fault-tolerant distributed algorithm; global termination; physical clock; roughly synchronized clocks; termination detection; Automation; Clocks; Concurrent computing; Distributed algorithms; Distributed computing; Fault detection; Fault tolerance; Physics computing; Programming; Synchronization;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Systems, 1997. Proceedings., 1997 International Conference on
Conference_Location
Seoul
Print_ISBN
0-8186-8227-2
Type
conf
DOI
10.1109/ICPADS.1997.652624
Filename
652624
Link To Document