Title :
Research on Fault-tolerant Mechanism of Integrating Water-Domain Oriented Computing Resources
Author :
Shang, Ling ; Wang, Zhijian ; Zhang, Xiaohong ; Wang, Junjie ; Liu, Zhizhong
Author_Institution :
Coll. of Comput. & Inf. Eng., Hohai Univ., Nanjing, China
Abstract :
Water domain grid platform, a grid platform based on cycle stealing technology is used to harness idle computing resources in one or several labs of one or several sites for its low costs and high performance. Volatility is the key challenge of this kind of platform and one fault will generate when a computing node leaves the platform. So how to make these volatile nodes work together without being influenced by generated faults is a key issue. So this paper presents a fault tolerance architecture aiming at minimizing generating faults. Once faults generated, other idle computation nodes in this platform can go on executing the unfinished task immediately. Finally some experiments based on this platform show that the framework has good performance in dealing with fault tolerance in water domain oriented computing resources integrated platform.
Keywords :
fault tolerant computing; grid computing; resource allocation; cycle stealing technology; fault-tolerant mechanism; water domain grid platform; water-domain oriented computing resource; Concurrent computing; Costs; Fault tolerance; Grid computing; High performance computing; Internet; Laboratories; Resource management; Water conservation; Water resources; computing resources integrated; fault tolerance; matchmaking; volatile; water-domain oriented;
Conference_Titel :
Computer and Information Science, 2009. ICIS 2009. Eighth IEEE/ACIS International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-0-7695-3641-5
DOI :
10.1109/ICIS.2009.115