Title :
System Failure Prediction through Rare-Events Elastic-Net Logistic Regression
Author :
Navarro, Jose M. ; Parada, G. Hugo A. ; Duenas, Juan C.
Author_Institution :
Center for Open Middleware, Univ. Politec. de Madrid, Madrid, Spain
Abstract :
Predicting failures in a distributed system based on previous events through logistic regression is a standard approach in literature. This technique is not reliable, though, in two situations: in the prediction of rare events, which do not appear in enough proportion for the algorithm to capture, and in environments where there are too many variables, as logistic regression tends to over fit on this situations, while manually selecting a subset of variables to create the model is error-prone. On this paper, we solve an industrial research case that presented this situation with a combination of elastic net logistic regression, a method that allows us to automatically select useful variables, a process of cross-validation on top of it and the application of a rare events prediction technique to reduce computation time. This process provides two layers of cross-validation that automatically obtain the optimal model complexity and the optimal model parameters values, while ensuring even rare events will be correctly predicted with a low amount of training instances. We tested this method against real industrial data, obtaining a total of 60 out of 80 possible models with a 90% average model accuracy.
Keywords :
computational complexity; distributed processing; failure analysis; feature selection; regression analysis; computation time; cross-validation; distributed system; elastic net logistic regression; industrial research case; optimal model complexity; optimal model parameters value; predicting failure; rare events prediction technique; rare-events elastic-net logistic regression; system failure prediction; Complexity theory; Computational modeling; Data models; Logistics; Prediction algorithms; Predictive models; Training; Automatic Feature Selection; Logistic Regression; Machine Learning; Multivariable Prediction; Online Failure Prediction; System Management;
Conference_Titel :
Artificial Intelligence, Modelling and Simulation (AIMS), 2014 2nd International Conference on
Print_ISBN :
978-1-4799-7599-0
DOI :
10.1109/AIMS.2014.19