• DocumentCode
    3349150
  • Title

    A switched dynamical system framework for analysis of massively parallel asynchronous numerical algorithms

  • Author

    Kooktae Lee ; Bhattacharya, Raktim ; Gupta, Vijay

  • Author_Institution
    Dept. of Aerosp. Eng., Texas A&M Univ., College Station, TX, USA
  • fYear
    2015
  • fDate
    1-3 July 2015
  • Firstpage
    1095
  • Lastpage
    1100
  • Abstract
    In the near future, massively parallel computing systems will be necessary to solve computation intensive applications. The key bottleneck in massively parallel implementation of numerical algorithms is the synchronization of data across processing elements (PEs) after each iteration, which results in significant idle time. Thus, there is a trend towards relaxing the synchronization and adopting an asynchronous model of computation to reduce idle time. However, it is not clear what is the effect of this relaxation on the stability and accuracy of the numerical algorithm. In this paper we present a new framework to analyze such algorithms. We treat the computation in each PE as a dynamical system and model the asynchrony as stochastic switching. The overall system is then analyzed as a switched dynamical system. However, modeling of massively parallel numerical algorithms as switched dynamical systems results in a very large number of modes, which makes current analysis tools available for such systems computationally intractable. We develop new techniques that circumvent this scalability issue. The framework is presented on a one-dimensional heat equation as a case study for the partial differential equation (PDE), and the proposed analysis tools are verified by implementing asynchronous communications between cores on an nVIDIA Tesla™ GPU machine.
  • Keywords
    graphics processing units; parallel processing; partial differential equations; PDE; PE; asynchronous communications; computation intensive applications; massively parallel asynchronous numerical algorithms; massively parallel computing systems; massively parallel implementation; massively parallel numerical algorithms; nVIDIA Tesla GPU machine; one-dimensional heat equation; partial differential equation; processing elements; stability; stochastic switching; switched dynamical system framework; Convergence; Eigenvalues and eigenfunctions; Heuristic algorithms; Linear matrix inequalities; Lyapunov methods; Steady-state; Switches;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    American Control Conference (ACC), 2015
  • Conference_Location
    Chicago, IL
  • Print_ISBN
    978-1-4799-8685-9
  • Type

    conf

  • DOI
    10.1109/ACC.2015.7170879
  • Filename
    7170879