Title :
System log pre-processing to improve failure prediction
Author :
Zheng, Ziming ; Lan, Zhiling ; Park, Byung H. ; Geist, Al
Author_Institution :
Illinois Inst. of Technol., Chicago, TN, USA
fDate :
June 29 2009-July 2 2009
Abstract :
Log preprocessing, a process applied on the raw log before applying a predictive method, is of paramount importance to failure prediction and diagnosis. While existing filtering methods have demonstrated good compression rate, they fail to preserve important failure patterns that are crucial for failure analysis. To address the problem, in this paper we present a log preprocessing method. It consists of three integrated steps: (1) event categorization to uniformly classify system events and identify fatal events; (2) event filtering to remove temporal and spatial redundant records, while also preserving necessary failure patterns for failure analysis; (3) causality-related filtering to combine correlated events for filtering through apriori association rule mining. We demonstrate the effectiveness of our preprocessing method by using real failure logs collected from the Cray XT4 at ORNL and the Blue Gene/L system at SDSC. Experiments show that our method can preserve more failure patterns for failure analysis, thereby improving failure prediction by up to 174%.
Keywords :
data mining; fault tolerant computing; apriori association rule mining; causality-related filtering; event categorization; event filtering; failure analysis; failure logs; failure pattern; failure prediction; fatal event identification; spatial redundant records; system event classification; system log preprocessing; temporal redundant records; Association rules; Data analysis; Data mining; Failure analysis; Filtering; Information resources; Laboratories; Large-scale systems; Production systems; Productivity; Cray XT4; IBM Blue Gene/L; event categorization; event filtering; log preprocessing;
Conference_Titel :
Dependable Systems & Networks, 2009. DSN '09. IEEE/IFIP International Conference on
Conference_Location :
Lisbon
Print_ISBN :
978-1-4244-4422-9
Electronic_ISBN :
978-1-4244-4421-2
DOI :
10.1109/DSN.2009.5270289