Title :
Semi-active replication of SNMP objects in agent groups applied for fault management
Author :
Duarte, Elias Procópio, Jr. ; Santos, Aldri L dos
Author_Institution :
Dept. Inf., Fed. Univ. of Parana, Brazil
Abstract :
It is often useful to examine management information base (MIB) objects of a faulty agent in order to determine why it is faulty. This paper presents a new framework for semi-active replication of SNMP management objects in local area networks. The framework is based on groups of agents that communicate with each other using reliable multicast. A group of agents provides fault-tolerant object functionality. An SNMP service is proposed that allows replicated MIB objects of a faulty agent of a given group to be accessed through fault-free agents of that group. The presented framework allows the dynamic definition of agent groups, and management objects to be replicated in each group. A practical fault-tolerant tool for local area network fault management was implemented and is presented. The system employs SNMP agents that interact with a group communication tool. As an example, we show how the examination of TCP-related objects of faulty agents have been used in the fault diagnosis process. The impact of replication on network performance is evaluated
Keywords :
computer network management; fault diagnosis; fault tolerant computing; local area networks; performance evaluation; replicated databases; software agents; telecommunication computing; telecommunication network reliability; transport protocols; LAN fault-tolerant tool; SNMP agents; SNMP management objects; SNMP service; TCP-related objects; agent groups; fault diagnosis; fault management; fault-free agents; fault-tolerant object; faulty agent; group communication tool; local area networks; management information base objects; management objects; network performance evaluation; reliable multicast; replicated MIB objects; semi-active replication; Computer crashes; Fault detection; Fault diagnosis; Fault tolerance; Informatics; Information management; Local area networks; Monitoring; Protocols; Telecommunication network reliability;
Conference_Titel :
Integrated Network Management Proceedings, 2001 IEEE/IFIP International Symposium on
Conference_Location :
Seattle, WA
Print_ISBN :
0-7803-6719-7
DOI :
10.1109/INM.2001.918066