DocumentCode :
1661765
Title :
A new approach to configurable dynamic scheduling in clusters based on single system image technologies
Author :
Vallée, Geoffroy ; Morin, Christine ; Berthou, Jean-Yves ; Rilling, Louis
Author_Institution :
R&D, Electricite de France, Clamart, France
fYear :
2003
Abstract :
Clusters are now considered as an alternative to parallel machines to execute workloads made up of sequential and/or parallel applications. For efficient application execution on clusters, dynamic global process scheduling is of prime importance. Different dynamic scheduling policies that have been studied for distributed systems or parallel machines may be used in clusters. The choice of a particular policy depends on the kind of workload to be executed. In a cluster, it is thus highly desirable to implement a configurable global scheduler to be able to adapt the dynamic scheduling policy to the workload characteristics, to take benefit of all cluster resources and to cope with node shutdown and reboot. In this paper, we present the architecture of the global scheduler and the process management mechanisms of Kerrighed, a single system image operating system designed for high performance computing on clusters. Kerrighed provides a development framework allowing to easily implement dynamic scheduling policies without kernel modification. In Kerrighed, the global scheduling policy can be dynamically changed while applications execute on the cluster Kerrighed´s process management mechanisms allow to easily deploy parallel applications in the cluster and to efficiently migrate or checkpoint processes, including processes sharing memory. Kerrighed has been implemented as a set of modules extending Linux kernel. Preliminary performance results are presented.
Keywords :
network operating systems; workstation clusters; Kerrighed; Linux kernel; cluster computing; configurable dynamic scheduling; configurable global scheduler; dynamic global process scheduling; high performance computing; process management mechanisms; single system image operating system; single system image technologies; Computer architecture; Dynamic scheduling; High performance computing; Kernel; Linux; Memory management; Operating systems; Parallel machines; Processor scheduling; Research and development;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Symposium, 2003. Proceedings. International
ISSN :
1530-2075
Print_ISBN :
0-7695-1926-1
Type :
conf
DOI :
10.1109/IPDPS.2003.1213198
Filename :
1213198
Link To Document :
بازگشت