Title :
Antecedence graph based checkpointing and recovery for mobile agents
Author :
Kaur, Ramandeep ; Challa, Rama Krishna ; Singh, Rajwinder
Author_Institution :
Dept. of Comput. Sci., Nat. Inst. of Tech. Teachers´´ Training & Res., Chandigarh, India
Abstract :
Mobile agents are distributed programs which can move autonomously in a network, to perform tasks on behalf of user. Though mobile agents offer much more flexibility as compared to client-server computing, yet they have additional cost and issues such as security, reliability and fault tolerance which need to be addressed for successful adaptability of mobile agent technology for developing real life applications. Fault tolerance aims to provide reliable execution of agents even in face of failures that may occur on account of various errors that emerge during migration request failure, communication exceptions, system crashes or security violations. The graph based fault tolerance protocols have been successfully used for the implementation of fault tolerance in distributed computing. This paper proposes use of antecedence graphs and message logs for maintaining fault tolerance information of mobile agents. In order to reduce the overheads of the carrying fault tolerance information in form of large antecedence graphs, we propose the use of parallel checkpointing algorithm. For checkpointing, dependent agents are marked out using antecedence graphs; and only these agents are involved in process of taking checkpoints. In case of failures, the antecedence graphs and message logs are regenerated for recovery and then normal operation continued. Analysis of results show considerable improvement in terms of reduced message overhead, execution and recovery times as compared to the graph based existing approach.
Keywords :
checkpointing; distributed programming; fault tolerant computing; graph theory; mobile agents; antecedence graph; client-server computing; distributed computing; distributed program; graph based fault tolerance protocol; message log; mobile agent; parallel checkpointing algorithm; real life application; security violation; system crash; Barium; Checkpointing; Fault tolerance; Fault tolerant systems; Mobile agents; Multiagent systems; Servers; Antecedence graphs; Checkpointing; Message logs; Mobile agents; Reliability Failure; fault tolerance;
Conference_Titel :
Communication Control and Computing Technologies (ICCCCT), 2010 IEEE International Conference on
Conference_Location :
Ramanathapuram
Print_ISBN :
978-1-4244-7769-2
DOI :
10.1109/ICCCCT.2010.5670588