DocumentCode :
48897
Title :
A GPU-Accelerated Parallel Shooting Algorithm for Analysis of Radio Frequency and Microwave Integrated Circuits
Author :
Xue-Xin Liu ; Hao Yu ; Tan, Sheldon X.-D
Author_Institution :
Synopysis Corp., Mountain View, CA, USA
Volume :
23
Issue :
3
fYear :
2015
fDate :
Mar-15
Firstpage :
480
Lastpage :
492
Abstract :
This paper presents a new parallel shooting-Newton method based on a graphic processing unit (GPU)-accelerated periodic Arnoldi shooting solver (GAPAS) for fast periodic steady-state analysis of radio frequency/millimeter-wave integrated circuits. The new algorithm first explores a periodic structure of the state matrix by using a periodic Arnoldi algorithm for computing the resulting structured Krylov subspace in the generalized minimal residual (GMRES) solver. The resulting periodic Arnoldi shooting method is very amenable for massive parallel computing, such as GPUs. Second, the periodic Arnoldi-based GMRES solver in the shooting-Newton method is parallelized on the recent NVIDIA Tesla GPU platforms. We further explore CUDA GPUs features, such as coalesced memory access and overlapping transfers with computation to boost the efficiency of the resulting parallel GAPAS method. Experimental results from several industrial examples show that when compared with the state-of-the-art implicit GMRES method under the same accuracy, the new parallel shooting-Newton method can lead up to $8times$ speedup.
Keywords :
Newton method; graphics processing units; matrix algebra; microwave integrated circuits; GAPAS; GMRES solver; GPU-accelerated parallel shooting algorithm; NVIDIA Tesla platforms; Newton method; coalesced memory access; fast periodic steady-state analysis; generalized minimal residual solver; graphic processing unit; microwave integrated circuits; parallel computing; periodic Arnoldi shooting solver; periodic structure; radiofrequency integrated circuits; state matrix; structured Krylov subspace; Algorithm design and analysis; Equations; Graphics processing units; Instruction sets; Jacobian matrices; Mathematical model; Periodic structures; Arnoldi iteration; generalized minimal residual (GMRES); graphic processing unit (GPU) parallelization; periodic steady-state (PSS) analysis; shooting-Newton method; structured Krylov-subspace; structured Krylov-subspace.;
fLanguage :
English
Journal_Title :
Very Large Scale Integration (VLSI) Systems, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-8210
Type :
jour
DOI :
10.1109/TVLSI.2014.2309606
Filename :
6777551
Link To Document :
بازگشت