DocumentCode :
3716798
Title :
Design and Verification of Heterogeneous Streaming Parallel Mechanisms on Kepler CUDA
Author :
Kailong Zhang;Shaoli Zhou;Liang Hu;Hang Su;Xiao Wu;Kejian Miao
Author_Institution :
Sch. of Comput. Sci. &
fYear :
2015
Firstpage :
2256
Lastpage :
2262
Abstract :
In many-core based parallel computing field, how to optimally allocate and schedule computing core resources according to characteristics of parallel applications is one typical and fundamental problem, which touches closely to computing performances. After analyzing features and mechanisms of Kepler CUDA architecture, three heterogeneous streaming parallel computing modes and corresponding constraints, and mechanisms are studied and described in detail. Considering the performance differences between different processing steps of one parallel task, a novel mechanism for balancing resource and performance of one whole task is further studied. Finally, we present typical implementation methods on Kepler CUDA processor, and implement typical matrix-processing algorithms and complicated target-detecting algorithms with these three different computing modes. Experiments show that these modes can adapt different types of applications, and the performance of pipelining parallel computing mode is usually better.
Keywords :
"Graphics processing units","Synchronization","Parallel processing","Instruction sets","Computer architecture","Kernel","Algorithm design and analysis"
Publisher :
ieee
Conference_Titel :
Computer and Information Technology; Ubiquitous Computing and Communications; Dependable, Autonomic and Secure Computing; Pervasive Intelligence and Computing (CIT/IUCC/DASC/PICOM), 2015 IEEE International Conference on
Type :
conf
DOI :
10.1109/CIT/IUCC/DASC/PICOM.2015.333
Filename :
7363379
Link To Document :
بازگشت