Title :
An object-oriented approach for implementing algorithm-based fault tolerance
Author :
Acree, Richard K. ; Ullah, Nasr ; Karia, Adarsh ; Rahmeh, Joe T. ; Abraham, Jacob A.
Author_Institution :
Comput. Eng. Res. Center, Texas Univ., Austin, TX, USA
Abstract :
The authors demonstrate the practical use of an object-oriented system to incorporate fault tolerance and reliability into data objects. The object-based fault tolerance scheme uses abstraction to conceal algorithm-based fault tolerance layers. The scheme allows a layer of fault tolerance to be added to data objects without affecting or altering the use of the data objects. It is shown that the C++ class mechanisms of overloading and derivation permit the added fault tolerance to be transparent to the original data objects. To demonstrate the feasibility of this approach, using C++, a libray of matrix functions is presented and a layer of fault tolerance around matrix data objects is added. The weighted checksum code technique was implemented to create fault-tolerant matrix data objects. This allows programmers to add algorithm-based fault tolerance onto existing matrix applications without requiring modification to the original application. The implementation was experimentally evaluated using a software fault-injection tool, that emulated realistic hardware faults. An error coverage of over 96% was obtained with a memory overhead of 28%. The empirical results confirm the viability of the approach by demonstrating that object-based encapsulation is a valid method for transparently implementing algorithm-based fault tolerance
Keywords :
C language; fault tolerant computing; object-oriented programming; software fault tolerance; C++ class mechanisms; abstraction; algorithm-based fault tolerance; matrix functions; object-based encapsulation; object-oriented approach; reliability; software fault-injection tool; weighted checksum code technique; Application software; Costs; Data engineering; Fault tolerance; Fault tolerant systems; Hardware; Jacobian matrices; Operating systems; RNA; Reliability engineering;
Conference_Titel :
Computers and Communications, 1993., Twelfth Annual International Phoenix Conference on
Conference_Location :
Tempe, AZ
Print_ISBN :
0-7803-0922-7
DOI :
10.1109/PCCC.1993.344462