DocumentCode
48897
Title
A GPU-Accelerated Parallel Shooting Algorithm for Analysis of Radio Frequency and Microwave Integrated Circuits
Author
Xue-Xin Liu ; Hao Yu ; Tan, Sheldon X.-D
Author_Institution
Synopysis Corp., Mountain View, CA, USA
Volume
23
Issue
3
fYear
2015
fDate
Mar-15
Firstpage
480
Lastpage
492
Abstract
This paper presents a new parallel shooting-Newton method based on a graphic processing unit (GPU)-accelerated periodic Arnoldi shooting solver (GAPAS) for fast periodic steady-state analysis of radio frequency/millimeter-wave integrated circuits. The new algorithm first explores a periodic structure of the state matrix by using a periodic Arnoldi algorithm for computing the resulting structured Krylov subspace in the generalized minimal residual (GMRES) solver. The resulting periodic Arnoldi shooting method is very amenable for massive parallel computing, such as GPUs. Second, the periodic Arnoldi-based GMRES solver in the shooting-Newton method is parallelized on the recent NVIDIA Tesla GPU platforms. We further explore CUDA GPUs features, such as coalesced memory access and overlapping transfers with computation to boost the efficiency of the resulting parallel GAPAS method. Experimental results from several industrial examples show that when compared with the state-of-the-art implicit GMRES method under the same accuracy, the new parallel shooting-Newton method can lead up to $8times$ speedup.
Keywords
Newton method; graphics processing units; matrix algebra; microwave integrated circuits; GAPAS; GMRES solver; GPU-accelerated parallel shooting algorithm; NVIDIA Tesla platforms; Newton method; coalesced memory access; fast periodic steady-state analysis; generalized minimal residual solver; graphic processing unit; microwave integrated circuits; parallel computing; periodic Arnoldi shooting solver; periodic structure; radiofrequency integrated circuits; state matrix; structured Krylov subspace; Algorithm design and analysis; Equations; Graphics processing units; Instruction sets; Jacobian matrices; Mathematical model; Periodic structures; Arnoldi iteration; generalized minimal residual (GMRES); graphic processing unit (GPU) parallelization; periodic steady-state (PSS) analysis; shooting-Newton method; structured Krylov-subspace; structured Krylov-subspace.;
fLanguage
English
Journal_Title
Very Large Scale Integration (VLSI) Systems, IEEE Transactions on
Publisher
ieee
ISSN
1063-8210
Type
jour
DOI
10.1109/TVLSI.2014.2309606
Filename
6777551
Link To Document