DocumentCode :
3677995
Title :
Failure Prediction for Cloud Datacenter by Hybrid Message Pattern Learning
Author :
Yukihiro Watanabe;Hiroshi Otsuka;Yasuhide Matsumoto
Author_Institution :
Syst. Software Labs., FUJITSU Labs. Ltd., Kawasaki, Japan
fYear :
2014
Firstpage :
425
Lastpage :
432
Abstract :
In operations and management of large-scale cloud data enters, it is essential for administrators to handle failures occurring in their infrastructure before causing service-level violations. Some techniques for failure prediction have been studied because they can be used to start the troubleshooting process at the early stage of troubles and to prevent service-level violations from occurring. By its nature, however, failure prediction involves a certain amount of incorrect detection (false-positive). When applying failure prediction to the operation and management of cloud data centers, incorrect detection can result in the execution of unnecessary workaround tasks and additional costs. Existing methods for failure prediction using Bayesian inference to identify message patterns related to a certain failure are difficult to apply to relatively stable systems, because the accuracy of their predictions deteriorates in environments where failure rarely occurs. In order to solve this problem, we propose a novel method to improve the accuracy of failure prediction by suppressing incorrect detections using a hybrid score that integrates the probability of simultaneous occurrence between a message pattern and a failure and frequency of the message patterns for the failure. We implemented this method and evaluated the accuracy in a real commercial cloud data enter. The evaluation results revealed that it improved the accuracy of failure prediction by 31.9% compared with the existing method in terms of precision in the best case.
Keywords :
"Bayes methods","Accuracy","Servers","Feature extraction","Correlation","Conferences"
Publisher :
ieee
Conference_Titel :
Ubiquitous Intelligence and Computing, 2014 IEEE 11th Intl Conf on and IEEE 11th Intl Conf on and Autonomic and Trusted Computing, and IEEE 14th Intl Conf on Scalable Computing and Communications and Its Associated Workshops (UTC-ATC-ScalCom)
Type :
conf
DOI :
10.1109/UIC-ATC-ScalCom.2014.6
Filename :
7306986
Link To Document :
بازگشت