Title :
Rare conditions-an important cause of failures
Author_Institution :
SoHaR Inc., Beverly Hills, CA, USA
Abstract :
Data gathered by the author as well as those published by others show that rarely executed code has a much higher failure rate expressed in terms of execution time than frequently executed code during the early operational period. More detailed analysis of the data shows that the inability to handle multiple rare conditions, such as response to hardware failures or exception conditions caused by the computer state, is a prominent cause of program failure in well-tested systems. Approaches to coping with this difficulty are discussed. We investigated differences in failure rate between rarely executed code (RC) and frequently executed code (FC) in the NASA/JPL Deep Space Network (DSN)
Keywords :
aerospace computing; fault tolerant computing; redundancy; software reliability; NASA/JPL Deep Space Network; computer state; data analysis; exception conditions; execution time; failure rate; frequently executed code; hardware failure; multiple rare conditions; program failure; rarely executed code; redundancy; software failure; well-tested systems; Data analysis; Failure analysis; Fault diagnosis; Frequency estimation; Hardware; Information retrieval; Redundancy; Space vehicles; Telemetry; Testing;
Conference_Titel :
Computer Assurance, 1993. COMPASS '93, Practical Paths to Assurance. Proceedings of the Eighth Annual Conference on
Conference_Location :
Gaithersburg, MD
Print_ISBN :
0-7803-1251-1
DOI :
10.1109/CMPASS.1993.288855