DocumentCode :
3116915
Title :
DFG implementation on multi GPU cluster with computation-communication overlap
Author :
Huet, Sylvain ; Boulos, Vincent ; Fristot, Vincent ; Salvo, Luc
Author_Institution :
GIPSA-Lab., UJF, Grenoble, France
fYear :
2011
fDate :
2-4 Nov. 2011
Firstpage :
1
Lastpage :
8
Abstract :
Nowadays, it is possible to build a multi-GPU supercomputer, well suited for implementation of digital signal processing algorithms, for a few thousand dollars. However, to achieve the highest performance with this kind of architecture, the programmer has to focus on inter-processor communications, tasks synchronization ... In this paper, we propose a design flow allowing an efficient implementation of a Digital Signal Processing (DSP) application specified as a Data Flow Graph (DFG) on a multi GPU computer cluster. We focus particularly on the effective implementation of communications by automating the computation-communication overlap, which can lead to significant speedups as shown in the presented benchmark. The approach is validated on a 3D granulometry application developed for research on materials.
Keywords :
coprocessors; data flow graphs; digital signal processing chips; 3D granulometry application; computation-communication overlap automation; data flow graph; design flow; digital signal processing algorithm; inter-processor communication; multiGPU computer cluster; multiGPU supercomputer; tasks synchronization; Computer architecture; Computers; Digital signal processing; Graphics processing unit; Instruction sets; Kernel; Synchronization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Design and Architectures for Signal and Image Processing (DASIP), 2011 Conference on
Conference_Location :
Tampere
Print_ISBN :
978-1-4577-0620-2
Electronic_ISBN :
978-1-4577-0619-6
Type :
conf
DOI :
10.1109/DASIP.2011.6136859
Filename :
6136859
Link To Document :
بازگشت