DocumentCode :
145407
Title :
CUDA Memory Techniques for Matrix Multiplication on Quadro 4000
Author :
Athil, Tekesha ; Christian, Rehtanz ; Reddy, Yenumula B.
fYear :
2014
fDate :
7-9 April 2014
Firstpage :
419
Lastpage :
425
Abstract :
Today, the industry old adage of sequential processing is certainly no longer sufficient. The need for high performance computation is ever growing, even though certain problem sets remain within the realm of super high performance computing with applications such as weather forecasting, quantum physics and climate research to name a few. Within the commercial realm of computation, NVIDIA has proposed an architectural framework (NVIDIA CUDA) to harness the power of GPUs which before was only been utilized for Graphics Application like 3D games, but now recently, been used for certain types of high performance computation. In this paper, we will take a critical look at different performance techniques such as tiling, memory coalescing, perfecting, and loop unrolling, in trying to evaluate which method is the most efficient approach for our problem set (matrix operation of n size matrices).
Keywords :
mathematics computing; matrix multiplication; parallel architectures; parallel processing; shared memory systems; 3D games; CUDA memory techniques; GPUs; NVIDIA CUDA; Quadro 4000; architectural framework; climate research; graphics application; matrix multiplication; memory coalescing; memory loop unrolling; memory perfecting; memory tiling; quantum physics; sequential processing; shared memory system; super high performance computing; weather forecasting; Central Processing Unit; Graphics processing units; Instruction sets; Kernel; Registers; Testing; Tiles; GPU; loop unrolling; memory coalescing; perfecting; performance; tiling;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Technology: New Generations (ITNG), 2014 11th International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4799-3187-3
Type :
conf
DOI :
10.1109/ITNG.2014.24
Filename :
6822233
Link To Document :
بازگشت