Title :
Fault-tolerant scheduling algorithm for distributed real-time systems
Author :
Tsuchiya, Tatsuhiro ; Kakuda, Yoshiaki ; Kikuno, Tohru
Author_Institution :
Dept. of Inf. & Comput. Sci., Osaka Univ., Japan
Abstract :
In a distributed real-time system, tolerance to faults on processing nodes in the system is achieved by means of redundant nodes and a fault-tolerant scheduling algorithm. Since redundant nodes increase total failure rate of the system, the number of such redundant nodes should be small. This paper proposes a procedure for fault-tolerant scheduling, which realizes fault-tolerance via a small number of redundant nodes. The procedure is based on such a technique that allows multiple copies of a task to be executed concurrently. It achieves efficient utilization of nodes by forcing copies being executed to terminate immediately after having obtained the first result of these copies. A fundamental scheduling algorithm, into which the procedure is incorporated, is defined and its simulation results are shown
Keywords :
distributed processing; fault tolerant computing; processor scheduling; real-time systems; distributed real-time systems; fault-tolerant scheduling algorithm; multiple copies; processing nodes; redundant nodes; simulation results; Distributed computing; Fault detection; Fault tolerance; Fault tolerant systems; Processor scheduling; Real time systems; Redundancy; Scheduling algorithm;
Conference_Titel :
Parallel and Distributed Real-Time Systems, 1995. Proceedings of the Third Workshop on
Conference_Location :
Santa Barbara, CA
Print_ISBN :
0-8186-7099-1
DOI :
10.1109/WPDRTS.1995.470501