Title :
Fault-tolerant clock synchronization for distributed systems with high message delay variation
Author :
De Azevedo, Marcelo Moraes ; Blough, Douglas M.
Author_Institution :
Dept. of Electr. & Comput. Eng., California Univ., Irvine, CA, USA
Abstract :
Fault-tolerant clock synchronization is an important requirement in many distributed systems, especially in time-critical and safety-critical applications. Frequently, interactive convergence algorithms are used for fault-tolerant clock synchronization, providing advantages such as fully distributed operation, low message exchange overhead and simplicity of implementation. This paper presents the measured performance of three interactive convergence clock synchronization algorithms. Our experiments were conducted in a distributed UNIX environment featuring high message delay variation, which poses severe constraints on the clock synchronization tightness that may be achieved. The algorithms that were tested are: FTMA (fault-tolerant midpoint algorithm), AEFTMA (adaptive exponential averaging fault-tolerant midpoint algorithm), and SWA (sliding window algorithm). Our experimental results indicate that SWA outperforms the other algorithms in this environment, being able to achieve tighter synchronization under different simulated fault conditions. The superiority of SWA can be attributed to its high degree of fault tolerance, combined with its ability to treat messages with much longer than expected delays as faults
Keywords :
delays; distributed processing; fault tolerant computing; safety-critical software; synchronisation; adaptive exponential averaging fault-tolerant midpoint algorithm; distributed UNIX environment; distributed systems; fault tolerance; fault-tolerant clock synchronization; fault-tolerant midpoint algorithm; fully distributed operation; high message delay variation; interactive convergence algorithms; interactive convergence clock synchronization algorithms; low message exchange overhead; safety-critical applications; simulated fault conditions; sliding window algorithm; time-critical application; Application software; Clocks; Computational modeling; Convergence; Delay; Fault tolerance; Fault tolerant systems; Synchronization; Testing; Time factors;
Conference_Titel :
Fault-Tolerant Parallel and Distributed Systems, 1994., Proceedings of IEEE Workshop on
Conference_Location :
College Station, TX
Print_ISBN :
0-8186-6807-5
DOI :
10.1109/FTPDS.1994.494499