Title :
Parallel performance prediction for numerical codes in a multi-cluster environment
Author :
Romanazzi, Giuseppe ; Jimack, Peter K.
Author_Institution :
Sch. of Comput., Univ. of Leeds, Leeds
Abstract :
We propose a model for describing and predicting the performance of parallel numerical software on distributed memory architectures within a multi-cluster environment. The goal of the model is to allow reliable predictions to be made as to the execution time of a given code on a large number of processors of a given parallel system, and on a combination of systems, by only benchmarking the code on small numbers of processors. This has potential applications for the scheduling of jobs in a grid computing environment where informed decisions about which resources to use in order to maximize the performance and/or minimize the cost of a job will be valuable. The methodology is built and tested for a particular class of numerical code, based upon the multilevel solution of discretized partial differential equations, and despite its simplicity it is demonstrated to be extremely accurate and robust with respect to both the processor and communications architectures considered. Furthermore, results are also presented which demonstrate that excellent predictions may also be obtained for numerical algorithms that are more general than the pure multigrid solver used to motivate the methodology. These are based upon the use of a practical parallel engineering code that is briefly described. The potential significance of this work is illustrated via two scenarios which consider a Grid user who wishes to use the available resources either (i) to obtain a particular result as quickly as possible, or (ii) to obtain results to different levels of accuracy.
Keywords :
distributed memory systems; grid computing; scheduling; software performance evaluation; discretized partial differential equations; distributed memory architectures; grid computing environment; multi-cluster environment; numerical codes; parallel numerical software; parallel performance prediction; Clustering algorithms; Computational efficiency; Computer science; Concurrent computing; Grid computing; Hardware; Information technology; Predictive models; Robustness; Software performance; Cluster Computing; Grid Computing; Meta-Scheduling; Parallel Distributed Algorithms; Performance Evaluation and Prediction;
Conference_Titel :
Computer Science and Information Technology, 2008. IMCSIT 2008. International Multiconference on
Conference_Location :
Wisia
Print_ISBN :
978-83-60810-14-9
DOI :
10.1109/IMCSIT.2008.4747284