DocumentCode :
3015720
Title :
SPHINX: A Fault-Tolerant System for Scheduling in Dynamic Grid Environments
Author :
In, Jang-Uk ; Avery, Paul ; Cavanaugh, Richard ; Chitnis, Laukik ; Kulkarni, Mandar ; Ranka, Sanjay
Author_Institution :
Florida Univ., FL, USA
fYear :
2005
fDate :
04-08 April 2005
Abstract :
A grid consists of high-end computational, storage, and network resources that, while known a priori, are dynamic with respect to activity and availability. Efficient scheduling of requests to use grid resources must adapt to this dynamic environment while meeting administrative policies. In this paper, we describe a framework called SPHINX that can administrate grid policies, and schedule complex and data intensive scientific applications. We present experimental results for several scheduling strategies that effectively utilize the monitoring and job-tracking information provided by SPHINX. These results demonstrate that SPHINX can effectively schedule work across a large number of distributed clusters that are owned by multiple units in a virtual organization in a fault-tolerant way in spite of the highly dynamic nature of the grid and complex policy issues. The novelty lies in use of effective monitoring of resources and job execution tracking in making scheduling decisions and fault-tolerance - something that is missed in today’s grid environments.
Keywords :
fault tolerant computing; grid computing; resource allocation; scheduling; system monitoring; SPHINX framework; distributed clusters; fault-tolerant system; grid computing; grid resource scheduling; job-tracking information; resource monitoring; Application software; Availability; Collaborative software; Distributed computing; Dynamic scheduling; Fault tolerance; Fault tolerant systems; Grid computing; Processor scheduling; Resource management;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Symposium, 2005. Proceedings. 19th IEEE International
Print_ISBN :
0-7695-2312-9
Type :
conf
DOI :
10.1109/IPDPS.2005.409
Filename :
1419828
Link To Document :
بازگشت