DocumentCode :
2794025
Title :
Performance characterization of the NAS Parallel Benchmarks in OpenCL
Author :
Seo, Sangmin ; Jo, Gangwon ; Lee, Jaejin
Author_Institution :
Center for Manycore Program., Seoul Nat. Univ., Seoul, South Korea
fYear :
2011
fDate :
6-8 Nov. 2011
Firstpage :
137
Lastpage :
148
Abstract :
Heterogeneous parallel computing platforms, which are composed of different processors (e.g., CPUs, GPUs, FPGAs, and DSPs), are widening their user base in all computing domains. With this trend, parallel programming models need to achieve portability across different processors as well as high performance with reasonable programming effort. OpenCL (Open Computing Language) is an open standard and emerging parallel programming model to write parallel applications for such heterogeneous platforms. In this paper, we characterize the performance of an OpenCL implementation of the NAS Parallel Benchmark suite (NPB) on a heterogeneous parallel platform that consists of general-purpose CPUs and a GPU. We believe that understanding the performance characteristics of conventional workloads, such as the NPB, with an emerging programming model (i.e., OpenCL) is important for developers and researchers to adopt the programming model. We also compare the performance of the NPB in OpenCL to that of the OpenMP version. We describe the process of implementing the NPB in OpenCL and optimizations applied in our implementation. Experimental results and analysis show that the OpenCL version has different characteristics from the OpenMP version on multicore CPUs and exhibits different performance characteristics depending on different OpenCL compute devices. The results also indicate that the application needs to be rewritten or re-optimized for better performance on a different compute device although OpenCL provides source-code portability.
Keywords :
graphics processing units; high level languages; microprocessor chips; parallel programming; CPU; DSP; FPGA; GPU; NAS Parallel Benchmark suite; NAS parallel benchmarks; NPB; Open Computing Language; OpenCL; heterogeneous parallel computing platforms; heterogeneous parallel platform; heterogeneous platforms; open standard; parallel applications; parallel programming models; source-code portability; Computational modeling; Graphics processing unit; Indexes; Kernel; Multicore processing; Optimization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Workload Characterization (IISWC), 2011 IEEE International Symposium on
Conference_Location :
Austin, TX
Print_ISBN :
978-1-4577-2063-5
Electronic_ISBN :
978-1-4577-2062-8
Type :
conf
DOI :
10.1109/IISWC.2011.6114174
Filename :
6114174
Link To Document :
بازگشت