DocumentCode
424516
Title
The Synergetic Effect of Compiler, Architecture, and Manual Optimizations on the Performance of CFD on Multiprocessors
Author
Kuba, Masayuki ; Polychronopoulos, Constantine D. ; Gallivan, Kyle
Author_Institution
University of Illinois at Urbana-Champaign
fYear
1995
fDate
1995
Firstpage
72
Lastpage
72
Abstract
This paper discusses the comprehensive performance profiling, improvement and benchmarking of a Computational Fluid Dynamics code, one of the Grand Challenge applications, on three popular multiprocessors. In the process of analyzing performance we considered language, compiler, architecture, and algorithmic changes and quantified each of them and their incremental contribution to bottom-line performance. We demonstrate that parallelization alone cannot result in significant gains if the granularity of parallel threads and the effect of parallelization on data locality are not taken into account. Unlike benchmarking studies that often focus on the performance or effectiveness of parallelizing compilers on specific loop kernels, we used the entire CFD code to measure the global effectiveness of compilers and parallel architectures. We probed the performance bottlenecks in each case and derived solutions which eliminate or neutralize the performance inhibiting factors. The major conclusion of our work is that overall performance is extremely sensitive to the synergetic effects of compiler optimizations, algorithmic and code tuning, and architectural idiosyncrasies.
Keywords
CFD (computational fluid dynamics); cache optimization; parallel algorithms and programs; parallel architectures; parallel processing; parallelizing compilers; performance evaluation; profiling and program tuning; program optimization; Algorithm design and analysis; Computational fluid dynamics; Optimizing compilers; Parallel algorithms; Parallel architectures; Parallel processing; Performance analysis; Permission; Program processors; Research and development; CFD (computational fluid dynamics); cache optimization; parallel algorithms and programs; parallel architectures; parallel processing; parallelizing compilers; performance evaluation; profiling and program tuning; program optimization;
fLanguage
English
Publisher
ieee
Conference_Titel
Supercomputing, 1995. Proceedings of the IEEE/ACM SC95 Conference
Print_ISBN
0-89791-816-9
Type
conf
DOI
10.1109/SUPERC.1995.242067
Filename
1383209
Link To Document