DocumentCode :
1914030
Title :
End-to-end service failure diagnosis using belief networks
Author :
Steinder, M. ; Sethi, A.S.
Author_Institution :
Dept. of Comput. & Inf. Sci., Delaware Univ., Newark, DE, USA
fYear :
2002
fDate :
2002
Firstpage :
375
Lastpage :
390
Abstract :
We present fault localization techniques suitable for diagnosing end-to-end service problems in communication systems with complex topologies. We refine a layered system model that represents relationships between services and functions offered between neighboring protocol layers. In a given layer, an end-to-end service between two hosts may be provided using multiple host-to-host services offered in this layer between two hosts on the end-to-end path. Relationships among end-to-end and host-to-host services form a bipartite probabilistic dependency graph whose structure depends on the network topology in the corresponding protocol layer. When an end-to-end service fails or experiences performance problems it is important to efficiently find the responsible host-to-host services. Finding the most probable explanation (MPE) of the observed symptoms is NP-hard. We propose two fault localization techniques based on Pearl´s (1988) iterative algorithms for singly connected belief networks. The probabilistic dependency graph is transformed into a belief network, and then the approximations based on Pearl´s algorithms and exact bucket tree elimination algorithm are designed and evaluated through extensive simulation study.
Keywords :
belief networks; fault location; graph theory; iterative methods; network topology; probability; protocols; telecommunication network management; telecommunication network reliability; Pearl´s iterative algorithms; QoS guarantees; belief networks; bipartite probabilistic dependency graph; end-to-end service failure diagnosis; exact bucket tree elimination algorithm; fault localization techniques; host-to-host services; layered dependency graph; layered system model; most probable explanation; network availability; network fault management; network topology; nondeterministic reasoning; protocol layers; simulation; Availability; Collaboration; Computer networks; Fault diagnosis; Inference algorithms; Iterative algorithms; Network topology; Protocols; Quality of service; Uncertainty;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Network Operations and Management Symposium, 2002. NOMS 2002. 2002 IEEE/IFIP
Print_ISBN :
0-7803-7382-0
Type :
conf
DOI :
10.1109/NOMS.2002.1015595
Filename :
1015595
Link To Document :
بازگشت