Title :
CUDA parallel programming model
Author_Institution :
NVIDIA Research, USA
Abstract :
Presents a collection of slides covering the following topics: parallel threads; parallel algorithms; heterogeneous systems; CPU; GPU; concurrent threads; shared memory model; vector addition kernel; block synchronization; thread block; per-block shared memory; parallel reduction; serial SAXPY routine; and parallel SAXPY routine.
Keywords :
"Instruction sets","Graphics processing units","Tutorials","Parallel programming","Kernel","Parallel algorithms","Synchronization"
Conference_Titel :
Hot Chips 20 Symposium (HCS), 2008 IEEE
DOI :
10.1109/HOTCHIPS.2008.7476519