DocumentCode :
2253573
Title :
Whither generic recovery from application faults? A fault study using open-source software
Author :
Chandra, Subhachandra ; Chen, Peter M.
Author_Institution :
Div. of Comput. Sci. & Eng., Michigan Univ., Ann Arbor, MI, USA
fYear :
2000
fDate :
2000
Firstpage :
97
Lastpage :
106
Abstract :
We test the hypothesis that generic recovery techniques, such as process pairs, can survive most application faults without using application-specific information. We examine in detail the faults that occur in three, large, open-source applications: the Apache Web server, the GNOME desktop environment and the MySQL database. Using information contained in the bug reports and source code, we classify faults based on how they depend on the operating environment. We find that 72-87% of the faults are independent of the operating environment and are hence deterministic (non-transient). Recovering from the failures caused by these faults requires the use of application-specific knowledge. Half of the remaining faults depend on a condition in the operating environment that is likely to persist on retry, and the failures caused by these faults are also likely to require application-specific recovery. Unfortunately, only 5-14% of the faults were triggered by transient conditions, such as timing and synchronization, that naturally fix themselves during recovery. Our results indicate that classical application-generic recovery techniques, such as process pairs, will not be sufficient to enable applications to survive most failures caused by application faults
Keywords :
relational databases; search engines; software fault tolerance; system recovery; Apache Web server; GNOME desktop environment; MySQL database; application fault recovery; application-specific recovery; generic recovery techniques; open-source software; process pairs; synchronization; timing; Application software; Computer bugs; Computer errors; Computer science; Fault tolerance; Open source software; Programming profession; Redundancy; Software systems; System software;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Dependable Systems and Networks, 2000. DSN 2000. Proceedings International Conference on
Conference_Location :
New York, NY
Print_ISBN :
0-7695-0707-7
Type :
conf
DOI :
10.1109/ICDSN.2000.857521
Filename :
857521
Link To Document :
بازگشت