DocumentCode
475608
Title
Fault-Tolerance Mechanism of Computation Grid Service System Based on Mobile Agent
Author
Zhang, Zhirou ; Li, Ying
Author_Institution
Network & Inf. Center, North China Electr. Power Univ., Beijing
Volume
1
fYear
2008
fDate
3-4 Aug. 2008
Firstpage
161
Lastpage
165
Abstract
Constructing Computation Grid Service System with idle computers in an organization to provide computation service for Mobile Agent can save funds of high-performance computing and make full use of idle resources, but Fault-Tolerance mechanism must be researched to guarantee running of computation task when nodes or networks of the system fail. Three main parts of Fault-Tolerance mechanism of the system are researched in this paper. An adaptive Fault-Detection mechanism, a non-close, non-block and low-overhead Checkpointing mechanism, and a Partial Rollback Mechanism Based on Communication Domain are proposed, which can save overhead of Fault-Tolerance. Experiments have shown their advantages.
Keywords
fault tolerant computing; grid computing; mobile agents; checkpointing mechanism; computation grid service system; fault-tolerance mechanism; high-performance computing; mobile agent; partial rollback mechanism; Checkpointing; Communication system control; Computer architecture; Computer networks; Fault tolerance; Fault tolerant systems; Grid computing; High performance computing; Mobile agents; Mobile communication; Checkpointing; Computation Grid; Fault-Tolerance; Partial Rollback;
fLanguage
English
Publisher
ieee
Conference_Titel
Computing, Communication, Control, and Management, 2008. CCCM '08. ISECS International Colloquium on
Conference_Location
Guangzhou
Print_ISBN
978-0-7695-3290-5
Type
conf
DOI
10.1109/CCCM.2008.39
Filename
4609491
Link To Document