Title :
Automatic Trace-Based Performance Analysis of Metacomputing Applications
Author :
Becker, Daniel ; Wolf, Felix ; Frings, Wolfgang ; Geimer, Markus ; Wylie, Brian J N ; Mohr, Bernd
Author_Institution :
John von Neumann Inst. for Comput., Forschungszentrum Julich
Abstract :
The processing power and memory capacity of independent and heterogeneous parallel machines can be combined to form a single parallel system that is more powerful than any of its constituents. However, achieving satisfactory application performance on such a metacomputer is hard because the high latency of inter-machine communication as well as differences in hardware of constituent machines may introduce various types of wait states. In our earlier work, we have demonstrated that automatic pattern search in event traces can identify the sources of wait states in parallel applications running on a single computer. In this article, we describe how this approach can be extended to metacomputing environments with special emphasis on performance problems related to inter-machine communication. In addition, we demonstrate the benefits of our solution using a real-world multi-physics application.
Keywords :
metacomputing; parallel machines; event trace; inter-machine communication; metacomputing; parallel machine; Application software; Computer science; Concurrent computing; Delay; Grid computing; Hardware; Metacomputing; Parallel machines; Performance analysis; Weather forecasting; event tracing; grid computing; metacomputing; performance tools;
Conference_Titel :
Parallel and Distributed Processing Symposium, 2007. IPDPS 2007. IEEE International
Conference_Location :
Long Beach, CA
Print_ISBN :
1-4244-0910-1
Electronic_ISBN :
1-4244-0910-1
DOI :
10.1109/IPDPS.2007.370238