• DocumentCode
    48897
  • Title

    A GPU-Accelerated Parallel Shooting Algorithm for Analysis of Radio Frequency and Microwave Integrated Circuits

  • Author

    Xue-Xin Liu ; Hao Yu ; Tan, Sheldon X.-D

  • Author_Institution
    Synopysis Corp., Mountain View, CA, USA
  • Volume
    23
  • Issue
    3
  • fYear
    2015
  • fDate
    Mar-15
  • Firstpage
    480
  • Lastpage
    492
  • Abstract
    This paper presents a new parallel shooting-Newton method based on a graphic processing unit (GPU)-accelerated periodic Arnoldi shooting solver (GAPAS) for fast periodic steady-state analysis of radio frequency/millimeter-wave integrated circuits. The new algorithm first explores a periodic structure of the state matrix by using a periodic Arnoldi algorithm for computing the resulting structured Krylov subspace in the generalized minimal residual (GMRES) solver. The resulting periodic Arnoldi shooting method is very amenable for massive parallel computing, such as GPUs. Second, the periodic Arnoldi-based GMRES solver in the shooting-Newton method is parallelized on the recent NVIDIA Tesla GPU platforms. We further explore CUDA GPUs features, such as coalesced memory access and overlapping transfers with computation to boost the efficiency of the resulting parallel GAPAS method. Experimental results from several industrial examples show that when compared with the state-of-the-art implicit GMRES method under the same accuracy, the new parallel shooting-Newton method can lead up to $8times$ speedup.
  • Keywords
    Newton method; graphics processing units; matrix algebra; microwave integrated circuits; GAPAS; GMRES solver; GPU-accelerated parallel shooting algorithm; NVIDIA Tesla platforms; Newton method; coalesced memory access; fast periodic steady-state analysis; generalized minimal residual solver; graphic processing unit; microwave integrated circuits; parallel computing; periodic Arnoldi shooting solver; periodic structure; radiofrequency integrated circuits; state matrix; structured Krylov subspace; Algorithm design and analysis; Equations; Graphics processing units; Instruction sets; Jacobian matrices; Mathematical model; Periodic structures; Arnoldi iteration; generalized minimal residual (GMRES); graphic processing unit (GPU) parallelization; periodic steady-state (PSS) analysis; shooting-Newton method; structured Krylov-subspace; structured Krylov-subspace.;
  • fLanguage
    English
  • Journal_Title
    Very Large Scale Integration (VLSI) Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-8210
  • Type

    jour

  • DOI
    10.1109/TVLSI.2014.2309606
  • Filename
    6777551