DocumentCode :
652513
Title :
Resource Management Architecture for Fair Scheduling of Optional Computations
Author :
Camillo, Frederic ; Caron, Eddy ; Guivarch, Ronan ; Hurault, Aurelie ; Klein, Cristian ; Perez, C.
Author_Institution :
INPT, Univ. of Toulouse, Toulouse, France
fYear :
2013
fDate :
28-30 Oct. 2013
Firstpage :
113
Lastpage :
120
Abstract :
Most High-Performance Computing platforms require users to submit a pre-determined number of computation requests (also called jobs). Unfortunately, this is cumbersome when some of the computations are optional, i.e., they are not critical, but their completion would improve results. For example, given a deadline, the number of requests to submit for a Monte Carlo experiment is difficult to choose. The more requests are completed, the better the results are, however, submitting too many might overload the platform. Conversely, submitting too few requests may leave resources unused and misses an opportunity to improve the results. This paper introduces and solves the problem of scheduling optional computations. It proposes a generic client-server architecture and an implementation in a production GridRPC middleware, which auto-tunes the number of requests. Real-life experiments show that several metrics are improved, such as user satisfaction, fairness and the number of completed requests. Moreover, the solution is shown to be scalable.
Keywords :
grid computing; middleware; parallel architectures; remote procedure calls; resource allocation; scheduling; autotuning; generic client-server architecture; high-performance computing platform; optional computation scheduling; production GridRPC middleware; remote procedure call paradigm; resource management architecture; Aging; Bismuth; Cloud computing; Three-dimensional displays; HPC; optional computations; resource management; scheduling; uncertainty analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
P2P, Parallel, Grid, Cloud and Internet Computing (3PGCIC), 2013 Eighth International Conference on
Conference_Location :
Compiegne
Type :
conf
DOI :
10.1109/3PGCIC.2013.23
Filename :
6681217
Link To Document :
بازگشت