Title : 
Lightweight Fault-Tolerance Mechanism for Distributed Mobile Agent-Based Monitoring
         
        
        
            Author_Institution : 
Dept. of Comput. Sci., Kyonggi Univ., Suwon
         
        
        
        
        
        
            Abstract : 
Thanks to asynchronous and dynamic natures of mobile agents, a certain number of mobile agent-based monitoring mechanisms have actively been developed to monitor large scale and dynamic distributed networked systems adaptively and efficiently. Among them, some mechanisms attempt to adapt to dynamic changes in various aspects such as network traffic patterns, resource addition and deletion, network topology and so on. However, failures of some domain managers are very critical to providing correct, real-time and efficient monitoring functionality in a large-scale mobile agent-based distributed monitoring system. In this paper, we present a novel fault- tolerance mechanism to have the following advantageous features appropriate for large-scale and dynamic hierarchical mobile agent-based monitoring organizations. It supports fast failure detection functionality with low failure-free overhead by each domain manager transmitting heart-beat messages to its immediate higher-level manager. Also, it minimizes the number of non-faulty monitoring managers affected by failures of domain managers. Moreover, it allows consistent failure detection actions to be performed continuously in case of agent creation, migration and termination, and is able to execute consistent takeover actions even in concurrent failures of domain managers.
         
        
            Keywords : 
failure analysis; fault tolerant computing; mobile agents; system monitoring; distributed mobile agent-based monitoring; dynamic distributed network system; failure detection; fault-tolerance mechanism; heart-beat message transmission; Computerized monitoring; Condition monitoring; Fault tolerance; Information filtering; Information filters; Large-scale systems; Mobile agents; Mobile computing; Resource management; Telecommunication traffic;
         
        
        
        
            Conference_Titel : 
Consumer Communications and Networking Conference, 2009. CCNC 2009. 6th IEEE
         
        
            Conference_Location : 
Las Vegas, NV
         
        
            Print_ISBN : 
978-1-4244-2308-8
         
        
            Electronic_ISBN : 
978-1-4244-2309-5
         
        
        
            DOI : 
10.1109/CCNC.2009.4784958