مرکز منطقه ای اطلاع رساني علوم و فناوري - An implementation of the ACA on GPU platform

DocumentCode :

3750623

Title :

An implementation of the ACA on GPU platform

Author :

Xing Mu;Hou-Xing Zhou;Zhe Song;Wei-Bing Kong;Wei Hong

Author_Institution :

State Key Lab. of Millimeter Waves, Southeast University of China, Nanjing 210096

Volume :

fYear :

2015

Firstpage :

Lastpage :

Abstract :

In this paper, an implementation of the ACA on GPU platform is presented, involving two parts: the matrix compression using the ACA and the batched matrix-vector products utilizing H-matrix form. Some numerical examples are provided to demonstrate the overall performance of the proposed implementation of the ACA algorithm on GPU platform through comparison with the 4-threaded CPU algorithm. In these testing examples, the speedup ratio of performing matrix compression using the ACA can achieve about from 50 to 100 for the single-precision case and from 25 to 50 for the double-precision case, and that the speedup ratio of executing batched batched matrix-vector products utilizing H-matrix form can achieve about from 10 to 30 for the single-precision case and from 6 to 17 for the double-precision case.

Keywords :

"Graphics processing units","Method of moments","Kernel","Instruction sets","Scattering","Antennas","MLFMA"

Publisher :

ieee

Conference_Titel :

Microwave Conference (APMC), 2015 Asia-Pacific

Print_ISBN :

978-1-4799-8765-8

Type :

conf

DOI :

10.1109/APMC.2015.7413109

Filename :

7413109

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3750623