Title :
Root Cause Analysis and Proactive Problem Prediction for Self-Healing
Author :
Piao, Shunshan ; Park, Jeongmin ; Lee, Eunseok
Author_Institution :
Sungkyunkwan Univ., Suwon
Abstract :
As the rapid evolvement of distributed computing system, the requirements imposed on problem determination techniques are increased to help system control and manage in high levels of automated ways, which represents the capability of self-healing. Many artificial intelligent approaches are widely used in the fields of fault managements. In this paper, we propose an approach to fault management for self-healing system through learning and analyzing real-time information, to provide both root cause analysis and proactive problem prediction. Using Bayesian network algorithm, we describe a complex system as a compact model that presents probabilistic dependency relationships between various factors in such a domain. We also provide an improved process that deals with collected parameters in advance, which enhances learning efficiency and reduces learning time. For estimating the efficiency and accuracy, an experimental demonstration based on system performance measurements is implemented and evaluated via diverse comparisons, which shows the availability is optimistic.
Keywords :
artificial intelligence; systems analysis; ubiquitous computing; Bayesian network algorithm; artificial intelligent approach; complex system; distributed computing system; fault management; proactive problem prediction; probabilistic dependency relationships; root cause analysis; self-healing system; Artificial intelligence; Automatic control; Availability; Bayesian methods; Control systems; Distributed computing; Information analysis; Learning; Real time systems; System performance;
Conference_Titel :
Convergence Information Technology, 2007. International Conference on
Conference_Location :
Gyeongju
Print_ISBN :
0-7695-3038-9
DOI :
10.1109/ICCIT.2007.231