Title :
Checkpointing and rollback of wide-area distributed applications using mobile agents
Author :
Cao, Jiannong ; Chan, G.H. ; Jia, Weijia ; Dillon, Tharam S.
Author_Institution :
Internet Comput. & E-Commerce Lab., Hong Kong Polytech., Kowloon, China
Abstract :
We consider the problem of designing rollback error recovery algorithms for dynamic, wide area distributed systems like the Internet. The characteristics and the scale of such a system complicate the design and performance of the algorithms. Traditional message passing based algorithms incur large overhead, in both the network traffic and message passing delay, in such a wide-area environment. In this paper, we propose a novel approach to designing checkpointing and rollback algorithms using mobile agents as an aid. Using mobile agent leads to a reduction of the total amount of communication and allows us to design algorithms that take the advantage of the most up to date system information for decision making. It also allows us to develop algorithms implementing flexible and adaptive policies. A mobile agent enabled hybrid algorithm combining independent and coordinated checkpointing is proposed. A prototype of the algorithms is developed using IBM´s Aglets. Results of performance evaluation are presented and discussed
Keywords :
Internet; message passing; performance evaluation; software agents; system recovery; IBM´s Aglets; Internet; checkpointing; message passing based algorithms; message passing delay; mobile agents; network traffic; performance; performance evaluation; rollback; rollback error recovery algorithms; wide-area distributed applications; Algorithm design and analysis; Checkpointing; Decision making; Heuristic algorithms; Internet; Message passing; Mobile agents; Mobile communication; Prototypes; Telecommunication traffic;
Conference_Titel :
Parallel and Distributed Processing Symposium., Proceedings 15th International
Conference_Location :
San Francisco, CA
Print_ISBN :
0-7695-0990-8
DOI :
10.1109/IPDPS.2001.924943