Title : 
Effects of resource utilization monitoring in fault recovery
         
        
            Author : 
Sarnaik, T.R. ; Somani, Arun K.
         
        
            Author_Institution : 
Dept. of Electr. Eng., Washington Univ., Seattle, WA, USA
         
        
        
        
        
        
            Abstract : 
We develop a technique to reduce the time to perform online tests and diagnose faulty units and reduce the recovery time when a fault occurs in a system. A clue is given to the tester about possible faulty locations. Thus only a fraction of the resources within a system needs to be tested. This is accomplished by keeping track of the resources used by an application program when it executes. We demonstrate that a significant reduction in test time can be achieved, in particular for cache and memory subsystems. This technique can improve response time and meet more deadlines in soft real-time systems when the system employs online tests and recovery schemes. We develop this technique further and support our analysis using trace-driven simulation. We discuss ways to implement the resource utilization vector (RUV) scheme in a system, and show how the RUV scheme is used to improve the forward error recovery process.<>
         
        
            Keywords : 
buffer storage; computer testing; fault location; fault tolerant computing; online operation; real-time systems; resource allocation; system monitoring; system recovery; vectors; acceptance test; application program execution; cache subsystems; deadlines; fault recovery; faulty locations; faulty unit diagnosis; forward error recovery process; memory subsystems; online tests; program behavior; recovery time; resource utilization monitoring; resource utilization vector scheme; response time; soft real-time systems; test time reduction; trace-driven simulation; Analytical models; Cache memory; Computer science; Computerized monitoring; Delay; Error correction codes; Hardware; Real time systems; Resource management; System testing;
         
        
        
        
            Conference_Titel : 
Fault-Tolerant Computing, 1994. FTCS-24. Digest of Papers., Twenty-Fourth International Symposium on
         
        
            Conference_Location : 
Austin, TX, USA
         
        
            Print_ISBN : 
0-8186-5520-8
         
        
        
            DOI : 
10.1109/FTCS.1994.315662