Title :
Fast probabilistic fault diagnosis for large scale system
Author :
Chu, L.W. ; Zou, S.H. ; Cheng, S.D. ; Wang, W.D. ; Tian, C.Q.
Author_Institution :
State Key Lab. of Networking & Switching Technol., Beijing Univ. of Posts & Telecommun., Beijing, China
Abstract :
The challenges of fast fault diagnosis for large scale service systems are analyzed in this paper. A multi-layer management model is proposed to model the service scenario, which builds bipartite Bayesian network to denote dependence relationships. An incremental fault belief assessment method is proposed to analyze symptoms and compute posterior fault probabilities in an event-driven manner. Based on the method, we propose a greedy fault diagnosis algorithm to produce a sub-optimal explanation. To reduce the complexity of fault selection, we transform the fault diagnosis problem of finding MPE into finding most likely assignment of each fault, and propose corresponding algorithm. Simulation results prove the validity and efficiency of our algorithms.
Keywords :
belief networks; fault diagnosis; greedy algorithms; large-scale systems; bipartite Bayesian network; dependence relationship; fast probabilistic fault diagnosis; greedy fault diagnosis algorithm; incremental fault belief assessment method; large scale service system; multilayer management model; posterior fault probability; Bayesian methods; Computer network management; Computer science; Fault diagnosis; Laboratories; Large-scale systems; Quality of service; Telecommunication switching; Web and internet services; Web services; dependency model; event-driven; fault diagnosis; probabilistic diagnosis;
Conference_Titel :
Computer Science and Information Technology, 2009. ICCSIT 2009. 2nd IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-4519-6
Electronic_ISBN :
978-1-4244-4520-2
DOI :
10.1109/ICCSIT.2009.5234766