Title :
A framework for distributed fault management using intelligent software agents
Author :
Ekaette, Edidiong Uyai ; Far, Behrouz Homayoun
Author_Institution :
Dept. of Electr. & Comput. Eng., Calgary Univ., Alta., Canada
Abstract :
This paper proposes a framework for distributed management of network faults by software agents. Intelligent network agents with advanced reasoning capabilities address many of the issues for the distribution of processing and control in network management. The agents detect, correlate and selectively seek to derive a clear explanation of alarms generated in their domain. The causal relationship between faults and their effects is presented as a Bayesian network. As evidence (alarms) is gathered, the probability of the presence of any particular fault is strengthened or weakened. Agents having a narrower view of the network forward their findings to another with a much broader view of the network. Depending on the network´s degree of automation, the agent can carry out local recovery actions. A prototype reflecting the ideas discussed in this paper is under implementation.
Keywords :
Internet; belief networks; computer network management; computer network reliability; fault tolerant computing; protocols; software agents; Bayesian network; agent detection; alarm correlation; distributed network fault management; distributed processing; intelligent network agent; local recovery action; network automation; network management control; reasoning capability; simple network management protocol; software agent; Bayesian methods; Computer network management; Engineering management; Fault detection; IP networks; Information management; Intelligent agent; Intelligent networks; Robustness; Software agents;
Conference_Titel :
Electrical and Computer Engineering, 2003. IEEE CCECE 2003. Canadian Conference on
Print_ISBN :
0-7803-7781-8
DOI :
10.1109/CCECE.2003.1226015