DocumentCode :
588640
Title :
The Nature of the Times to Flight Software Failure during Space Missions
Author :
Alonso, J. Marcos ; Grottke, Michael ; Nikora, Allen P. ; Trivedi, Kishor S.
fYear :
2012
fDate :
27-30 Nov. 2012
Firstpage :
331
Lastpage :
340
Abstract :
The growing complexity of mission-critical space mission software makes it prone to suffer failures during operations. The success of space missions depends on the ability of the systems to deal with software failures, or to avoid them in the first place. In order to develop more effective mitigation techniques, it is necessary to understand the nature of the failures and the underlying software faults. Based on their characteristics, software faults can be classified into Bohrbugs, non-aging-related Mandelbugs, and aging-related bugs. Each type of fault requires different kinds of mitigation techniques. While Bohrbugs are usually easy to fix during development or testing, this is not the case for non-aging-related Mandelbugs and aging-related bugs due to their inherent complexity. Systems need mechanisms like software restart, software replication or software rejuvenation to deal with failures caused by these faults during the operational phase. In a previous study, we classified space mission flight software faults into the three above-mentioned categories based on problems reported during operations. That study concentrated on the percentages of the faults of each type and the variation of these percentages within and across different missions. This paper extends that work by exploring the nature of the times to software failure due to Bohrbugs and non-aging-related Mandelbugs for eight JPL/NASA missions. We start by applying trend tests to the times to failure to check if there is any reliability growth (or decay) for each type of failure. For those times to failure sequences with no trend, we fit distributions to the data sets and carry out goodness-of-fit tests. The results will be used to guide the development of improved operational failure mitigation techniques, thereby increasing the reliability of space mission software.
Keywords :
aerospace computing; program debugging; program testing; software fault tolerance; software metrics; Bohrbugs; JPL mission; NASA mission; aging-related bugs; goodness-of-fit tests; growing complexity; mission-critical space mission software; non-aging-related Mandelbugs; nonaging-related Mandelbugs; operational failure mitigation techniques; reliability growth; software rejuvenation; software replication; software restart; space mission flight software faults; space mission software reliability; space missions; times to failure sequences; times to flight software failure; trend tests; Computer bugs; Engines; Market research; Software; Software reliability; Space missions; Flight software; failure reports; goodness-of-fit; reliability growth; times to failure;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Software Reliability Engineering (ISSRE), 2012 IEEE 23rd International Symposium on
Conference_Location :
Dallas, TX
ISSN :
1071-9458
Print_ISBN :
978-1-4673-4638-2
Type :
conf
DOI :
10.1109/ISSRE.2012.32
Filename :
6405381
Link To Document :
بازگشت