Title :
Detecting failed processes using fault signatures
Author :
Avritzer, Alberto ; Weyuker, Elaine J.
Author_Institution :
AT&T Network Comput. Services, Red Hill, NJ, USA
Abstract :
A strategy is presented for automatically identifying processes that have failed. A determination is made of the types of data that need to be collected, and circumstances under which the approach is likely to be useful. The strategy is applied to generate signatures for three different types of workloads, and several different resources. A typical failure is injected into the process, and the associated signatures are presented for the same workloads and resources. A measure as defined that is used to determine whether or not a signature is likely to be indicative of a faulty process
Keywords :
software fault tolerance; failed processes detection; fault signatures; software fault tolerance; workloads; Application software; Computer networks; Costs; Degradation; Fault detection; Heart beat; Lakes; Monitoring; Signal processing; Software measurement;
Conference_Titel :
Computer Performance and Dependability Symposium, 1996., Proceedings of IEEE International
Conference_Location :
Urbana-Champaign, IL
Print_ISBN :
0-8186-7484-9
DOI :
10.1109/IPDS.1996.540231