DocumentCode :
2097119
Title :
Fault Tolerant Job Scheduling in Computational Grid
Author :
Nazir, Babar ; Khan, Taimoor
Author_Institution :
Dept. of Comput. Sci., COMSATS Inst. of Inf. Technol., Abbottabad
fYear :
2006
fDate :
13-14 Nov. 2006
Firstpage :
708
Lastpage :
713
Abstract :
In large-scale grids, the probability of a failure is much greater than in traditional parallel systems [I]. Therefore, fault tolerance has become a crucial area in grid computing. In this paper, we address the problem of fault tolerance in term of resource failure. We devise a strategy for fault tolerant job scheduling in computational grid. Proposed strategy maintains history of the fault occurrence of resource in grid information service (GIS). Whenever a resource broker has job to schedule it uses the resource fault occurrence history information from GIS and depending on this information use different intensity of check pointing and replication while scheduling the job on resources which have different tendency towards fault. Using check pointing proposed scheme can make grid scheduling more reliable and efficient. Further, it increases the percentage of jobs executed within specified deadline and allotted budget, hence helping in making grid trustworthy. Through simulation we have evaluated the performance of the proposed strategy. The experimental results demonstrate that proposed strategy effectively schedule the grid jobs in fault tolerant way in spite of highly dynamic nature of grid
Keywords :
checkpointing; grid computing; scheduling; software fault tolerance; check pointing; computational grid; fault tolerant job scheduling; grid computing; grid information service; Application software; Concurrent computing; Detectors; Distributed computing; Fault tolerance; Fault tolerant systems; Grid computing; History; Large-scale systems; Processor scheduling;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Emerging Technologies, 2006. ICET '06. International Conference on
Conference_Location :
Peshawar
Print_ISBN :
1-4244-0503-3
Electronic_ISBN :
1-4244-0503-3
Type :
conf
DOI :
10.1109/ICET.2006.335930
Filename :
4136898
Link To Document :
بازگشت