Title :
Hardware/software fault tolerance with multiple task modular redundancy
Author :
Fuhrman, Christopher P. ; Chutani, Sailesh ; Nussbaumer, Henri J.
Author_Institution :
Dept. of Comput. Sci., Swiss Federal Inst. of Technol., Lausanne, Switzerland
Abstract :
N-modular redundancy (NMR) and N-version programming (NVP) are two popular fault tolerance techniques in which hardware and software redundancy is exploited to mask faults. Redundant hardware is used to improve fault tolerance rather than throughput. We introduce a scheme for combined hardware-software fault tolerance derived from NMR and NVP that shows how redundancy can also be used to improve throughput by grouping the execution of several tasks. Our scheme uses a dynamic task allocation algorithm with an optimistic execution policy where the number of task executions is kept close to the minimum required to produce fault-free results. For equivalent hardware and software resources, the proposed method is 50% to 100% more efficient in terms of throughput and latency.
Keywords :
fault tolerant computing; modules; programming; redundancy; software fault tolerance; N-modular redundancy; N-version programming; dynamic task allocation algorithm; fault-free results; hardware redundancy; hardware resources; hardware-software fault tolerance; hardware/software fault tolerance; latency; multiple task modular redundancy; optimistic execution policy; redundant hardware; software redundancy; software resources; task executions; throughput; Delay; Fault detection; Fault tolerance; Fault tolerant systems; Hardware; Nuclear magnetic resonance; Redundancy; Testing; Throughput; Voting;
Conference_Titel :
Computers and Communications, 1995. Proceedings., IEEE Symposium on
Conference_Location :
Alexandria, Egypt
Print_ISBN :
0-8186-7075-4
DOI :
10.1109/SCAC.1995.523663