Title :
Definition and analysis of hardware- and software-fault-tolerant architectures
Author :
Laprie, Jean-Claude ; Arlat, Jean ; Béounes, Christian ; Kanoun, Karama
Author_Institution :
LAAS-CNRS, Toulouse, France
fDate :
7/1/1990 12:00:00 AM
Abstract :
A structured definition of hardware- and software-fault-tolerant architectures is presented. Software-fault-tolerance methods are discussed, resulting in definitions for soft and solid faults. A soft software fault has a negligible likelihood or recurrence and is recoverable, whereas a solid software fault is recurrent under normal operations or cannot be recovered. A set of hardware- and software-fault-tolerant architectures is presented, and three of them are analyzed and evaluated. Architectures tolerating a single fault and architectures tolerating two consecutive faults are discussed separately. A sidebar addresses the cost issues related to software fault tolerance. The approach taken throughout is as general as possible, dealing with specific classes of faults or techniques only when necessary.<>
Keywords :
computer architecture; fault tolerant computing; cost issues; hardware-fault-tolerant architectures; software-fault-tolerant architectures; Automatic testing; Computer architecture; Computer errors; Fault detection; Hardware; Parallel programming; Sequential analysis; Software safety; Software testing; System testing;