Title :
Simulation of software behavior under hardware faults
Author :
Goswami, Kumar K. ; Iyer, Ravishankar K.
Author_Institution :
Center for Reliable & High-Performance Comput., Illinois Univ., Urbana, IL, USA
Abstract :
A simulation-based software-model that permits application specific dependability analysis in the early design stages is introduced. The model represents an application program by decomposing it into a graph model consisting of a set of nodes, a set of edges that probabilistically determine the flow from node to node, and a mapping of the nodes to memory. The software model simulates the execution of the program while errors are injected into the program´s memory space. The model provides application-dependent parameters such as detection and propagation times and permits evaluation of function on system level error detection and recovery schemes. A case study illustrates the interaction between an application program and two detection schemes. Specifically, Gaussian elimination programs running on a Tandem Integrity S2 system with memory scrubbing are studied. Results from the simulation-based software model are validated with data measured from an actual Tandem Integrity S2 system. Application dependent coverage values obtained with the model are compared with those obtained via traditional schemes that assume uniform or ramp memory access patterns. For the authors´ program, some coverage values obtained with the traditional approaches were found to be 100% larger than those obtained with the software model.
Keywords :
software fault tolerance; Gaussian elimination programs; Tandem Integrity S2 system; application program; application specific dependability analysis; application-dependent parameters; graph model; hardware faults; memory scrubbing; ramp memory access patterns; simulation-based software-model; software behaviour simulation; software model; Analytical models; Application software; Availability; Computational modeling; Delay; Hardware; NASA; Random access memory; Software reliability; Software systems;
Conference_Titel :
Fault-Tolerant Computing, 1993. FTCS-23. Digest of Papers., The Twenty-Third International Symposium on
Conference_Location :
Toulouse, France
Print_ISBN :
0-8186-3680-7
DOI :
10.1109/FTCS.1993.627325