DocumentCode :
2500005
Title :
Hierarchical adaptive distributed system-level diagnosis applied for SNMP-based network fault management
Author :
Duarte, Elias Procópio, Jr. ; Nanya, Takashi
Author_Institution :
Tokyo Inst. of Technol., Japan
fYear :
1996
fDate :
23-25 Oct 1996
Firstpage :
98
Lastpage :
107
Abstract :
Fault management is a key functional area of network management systems, but currently deployed applications often implement rudimentary diagnosis mechanisms. This paper presents a new hierarchical adaptive distributed system-level diagnosis (Hi-ADSD) algorithm and its implementation based on SNMP (simple network management protocol). Hi-ADSD is a fully distributed algorithm that has diagnosis latency of at most (log2N)2 testing rounds for a network of N nodes. Nodes are mapped into progressively larger logical clusters, so that each node executes tests in a hierarchical fashion. The algorithm assumes no link faults, a fully-connected network and imposes no bounds on the number of faults. Both the worst-case diagnosis latency and correctness of the algorithm are formally proved. Experimental results are given through simulation of the algorithm for large networks. The algorithm was implemented on a small network using SNMP. We present details of the implementation, including device fault management, the role of the network management station, and the diagnosis management information base
Keywords :
adaptive systems; computer network management; computer network reliability; distributed algorithms; fault diagnosis; hierarchical systems; local area networks; protocols; Hi-ADSD algorithm; LAN; distributed algorithm; hierarchical adaptive distributed system-level diagnosis; local area network; network fault management; network nodes; simple network management protocol; worst-case diagnosis latency; Adaptive systems; Clustering algorithms; Computer network management; Delay; Distributed algorithms; Fault diagnosis; Monitoring; Protocols; Technology management; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Reliable Distributed Systems, 1996. Proceedings., 15th Symposium on
Conference_Location :
Nigara-on-the-Lake, Ont.
ISSN :
1060-9857
Print_ISBN :
0-8186-7481-4
Type :
conf
DOI :
10.1109/RELDIS.1996.559703
Filename :
559703
Link To Document :
بازگشت