DocumentCode :
2784760
Title :
In Search of Real Data on Faults, Errors and Failures
Author :
Malek, Miroslaw
Author_Institution :
Humboldt-Universit¿t zu Berlin, Germany
fYear :
2006
fDate :
18-20 Oct. 2006
Firstpage :
65
Lastpage :
65
Abstract :
Summary form only given. In order to make a relevant contribution to industrial practice and have an impact on the future systems and services, it is essential that the research community has an access to real data to be able to test effectiveness and verify correctness of proposed techniques for enhanced availability (the term "real data" refers to the field data collected at the customers\´ sites, not just at the lab where usually experts assume various scenarios without proper attention to operator mistakes, environment and customer\´s maintenance procedures). To date the community had rather sporadic opportunities to access of the field data and have developed the body of knowledge based frequently on wrong assumptions, hypothetical failure models and simplistic distributions. At the core of the problem is that the failure data are classified due to the competition and the fact that almost always it is attached to specific customers and its bulk may be enormous. With thousands of measurement points and up to about 1200 parameters that can be measured on computer and communication systems, the amount of data may reach from several Gbytes to over a hundred Gbytes per day. The key challenge is how to filter out real data and code it such that it can be accessed by the research community while at the same time the bulk of data is significantly reduced by focusing strictly on faults, errors and failures and their root causes. To change this state of affairs, the panel attempts to give pointers to the sources of real data, investigate the ways of collecting the data and making it accessible by the research community. The panel includes academic and industrial experts
Keywords :
error handling; software reliability; availability research; error analysis; failure analysis; fault analysis; real data access; Availability; Computer errors; Computer industry; Error correction; Filters; System testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Dependable Computing Conference, 2006. EDCC '06. Sixth European
Conference_Location :
Coimbra
Print_ISBN :
0-7695-2648-9
Type :
conf
DOI :
10.1109/EDCC.2006.15
Filename :
4020832
Link To Document :
بازگشت