DocumentCode
426859
Title
Performance Analysis and Optimization on the UCLA Parallel Atmospheric General Circulation Model Code
Author
Lou, John Z. ; Farrara, John D.
Author_Institution
California Institute of Technology, Pasadena, CA
fYear
1996
fDate
1996
Firstpage
14
Lastpage
14
Abstract
An analysis is presented of the primary factors influencing the performance of a parallel implementation of the UCLA atmospheric general circulation model (AGCM) on distributed-memory, massively parallel computer systems. Several modifications to the original parallel AGCM code aimed at improving its numerical efficiency, load-balance and single-node code performance are discussed. The impact of these optimization strategies on the performance on two of the state-of-the-art parallel computers, the Intel Paragon and Cray T3D, is presented and analyzed. It is found that implementation of a load-balanced FFT algorithm results in a reduction in overall execution time of approximately 45% compared to the original convolution-based algorithm. Preliminary results of the application of a load-balancing scheme for the Physics part of the AGCM code suggest additional reductions in execution time of 10-15% can be achieved. Finally, several strategies for improving the single-node performance of the code are presented, and the results obtained thus far suggest reductions in execution time in the range of 25-35% are possible.
Keywords
Atmospheric modeling; Computational modeling; Concurrent computing; Distributed computing; Drives; Laboratories; Paper technology; Performance analysis; Propulsion; Scalability;
fLanguage
English
Publisher
ieee
Conference_Titel
Supercomputing, 1996. Proceedings of the 1996 ACM/IEEE Conference on
Print_ISBN
0-89791-854-1
Type
conf
DOI
10.1109/SUPERC.1996.183520
Filename
1392889
Link To Document