DocumentCode
3048581
Title
Communication benchmarking and performance modelling of MPI programs on cluster computers
Author
Grove, D.A. ; Coddington, P.D.
Author_Institution
Sch. of Comput. Sci., Adelaide Univ., SA, Australia
fYear
2004
fDate
26-30 April 2004
Firstpage
249
Abstract
Summary form only given. We give an overview of two related tools that we have developed to provide more accurate measurement and modelling of the performance of message passing programs and communications on distributed memory parallel computers. MPIBench uses a very precise, globally synchronised clock to measure the performance of MPI communication routines, and can generate probability distributions of communication times, not just the average values produced by other MPI benchmarks. This allows useful insights into MPI communications performance of parallel computers, particularly the effects of network contention. PEVPM provides a simple, fast and accurate technique for performance modelling and prediction of message-passing parallel programs. It uses a virtual parallel machine to simulate the execution of the parallel program. The effects of network contention can be accurately modelled by sampling from the probability distributions generated by MPIBench. These tools are particularly useful on Beowulf clusters with commodity Ethernet networks, where relatively high latencies, network congestion and TCP problems can significantly affect communication performance, and can be difficult to model accurately using other tools. Experiments with example parallel programs demonstrate that PEVPM gives accurate performance predictions on Beowulf clusters. We also show that modelling communication performance using average times rather than sampling from probability distributions can give misleading results, particularly for a large number of processors.
Keywords
benchmark testing; distributed memory systems; message passing; parallel machines; parallel programming; performance evaluation; statistical distributions; virtual machines; workstation clusters; Beowulf clusters; MPI program; MPIBench tool; PEVPM tool; cluster computer; communication benchmarking; distributed memory parallel computer; message passing program; performance modeling; probability distribution; virtual parallel machine; Clocks; Computer networks; Concurrent computing; Distributed computing; Message passing; Parallel machines; Predictive models; Probability distribution; Sampling methods; Synchronization;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Processing Symposium, 2004. Proceedings. 18th International
Print_ISBN
0-7695-2132-0
Type
conf
DOI
10.1109/IPDPS.2004.1303309
Filename
1303309
Link To Document