DocumentCode
244397
Title
What Logs Should You Look at When an Application Fails? Insights from an Industrial Case Study
Author
Cinque, Marcello ; Cotroneo, Domenico ; Della Corte, Raffaele ; Pecchia, Antonio
Author_Institution
Dipt. di Ing. Elettr. e Tecnol. dell´Inf., Univ. degli Studi di Napoli Federico II, Naples, Italy
fYear
2014
fDate
23-26 June 2014
Firstpage
690
Lastpage
695
Abstract
Event logs are the first place where to find useful information about application failures. Event logs are available at different system levels, such as application, middleware and operating system. In this paper we analyze the failure reporting capability of event logs collected at different levels of an industrial system in the Air Traffic Control (ATC) domain. The study is based on a data set of 3,159 failures induced in the system by means of software fault injection. Results indicate that the reporting ability of event logs collected at a given level is strongly affected by the type of failure observed at runtime. For example, even if operating system logs catch almost all application crashes, they are strongly ineffective in face of silent and erratic failures in the considered system.
Keywords
air traffic control; middleware; operating systems (computers); software fault tolerance; system monitoring; ATC domain; air traffic control domain; application failures; event logs; failure reporting capability; industrial system; middleware; operating system logs; software fault injection; Computer crashes; Failure analysis; Logistics; Middleware; Operating systems; Radio frequency; air traffic control; event log; failure analysis; fault injection; middleware;
fLanguage
English
Publisher
ieee
Conference_Titel
Dependable Systems and Networks (DSN), 2014 44th Annual IEEE/IFIP International Conference on
Conference_Location
Atlanta, GA
Type
conf
DOI
10.1109/DSN.2014.69
Filename
6903626
Link To Document