Title :
A Fault-Tolerant Voting Scheme for Multithreaded Environments
Author :
Fechner, Bernhard ; Keller, Jörg
Author_Institution :
FernUniversität Hagen
Abstract :
Voting schemes are widely used in fault-tolerant systems, mainly systems which imply temporal or component redundancy. We present a voting scheme for multithreaded environments which is based on the observation that a fault-tolerant system which does not know its history can not distinguish between transient (SEUs) and permanent errors, caused by use of a faulty component. The history of errors is used to predict future errors and to determine if a permanent or transient error occurred. Only in the former case a repair is necessary; in the latter case recovery is sufficient. Using prediction and credibility points we are able to tell if a system failure is likely to occur soon. The more credibility a version has, the more likely it will compute a correct result. Therefore we can use credibility points in connection with thread priorisation to increase performance.
Keywords :
Fault tolerance; Fault tolerant systems; Hardware; History; Multithreading; Parallel processing; Redundancy; Single event transient; Voting; Yarn;
Conference_Titel :
Parallel Computing in Electrical Engineering, 2004. PARELEC 2004. International Conference on
Print_ISBN :
0-7695-2080-4
DOI :
10.1109/PCEE.2004.2