DocumentCode :
3750623
Title :
An implementation of the ACA on GPU platform
Author :
Xing Mu;Hou-Xing Zhou;Zhe Song;Wei-Bing Kong;Wei Hong
Author_Institution :
State Key Lab. of Millimeter Waves, Southeast University of China, Nanjing 210096
Volume :
2
fYear :
2015
Firstpage :
1
Lastpage :
3
Abstract :
In this paper, an implementation of the ACA on GPU platform is presented, involving two parts: the matrix compression using the ACA and the batched matrix-vector products utilizing H-matrix form. Some numerical examples are provided to demonstrate the overall performance of the proposed implementation of the ACA algorithm on GPU platform through comparison with the 4-threaded CPU algorithm. In these testing examples, the speedup ratio of performing matrix compression using the ACA can achieve about from 50 to 100 for the single-precision case and from 25 to 50 for the double-precision case, and that the speedup ratio of executing batched batched matrix-vector products utilizing H-matrix form can achieve about from 10 to 30 for the single-precision case and from 6 to 17 for the double-precision case.
Keywords :
"Graphics processing units","Method of moments","Kernel","Instruction sets","Scattering","Antennas","MLFMA"
Publisher :
ieee
Conference_Titel :
Microwave Conference (APMC), 2015 Asia-Pacific
Print_ISBN :
978-1-4799-8765-8
Type :
conf
DOI :
10.1109/APMC.2015.7413109
Filename :
7413109
Link To Document :
بازگشت