• DocumentCode
    1765054
  • Title

    Approximate N -Player Nonzero-Sum Game Solution for an Uncertain Continuous Nonlinear System

  • Author

    Johnson, Marcus ; Kamalapurkar, Rushikesh ; Bhasin, Shubhendu ; Dixon, Warren E.

  • Author_Institution
    Dept. of Mech. & Aerosp. Eng., Univ. of Florida, Gainesville, FL, USA
  • Volume
    26
  • Issue
    8
  • fYear
    2015
  • fDate
    Aug. 2015
  • Firstpage
    1645
  • Lastpage
    1658
  • Abstract
    An approximate online equilibrium solution is developed for an N-player nonzero-sum game subject to continuous-time nonlinear unknown dynamics and an infinite horizon quadratic cost. A novel actor-critic-identifier structure is used, wherein a robust dynamic neural network is used to asymptotically identify the uncertain system with additive disturbances, and a set of critic and actor NNs are used to approximate the value functions and equilibrium policies, respectively. The weight update laws for the actor neural networks (NNs) are generated using a gradient-descent method, and the critic NNs are generated by least square regression, which are both based on the modified Bellman error that is independent of the system dynamics. A Lyapunov-based stability analysis shows that uniformly ultimately bounded tracking is achieved, and a convergence analysis demonstrates that the approximate control policies converge to a neighborhood of the optimal solutions. The actor, critic, and identifier structures are implemented in real time continuously and simultaneously. Simulations on two and three player games illustrate the performance of the developed method.
  • Keywords
    Lyapunov methods; continuous systems; continuous time systems; convergence; game theory; gradient methods; infinite horizon; least squares approximations; neurocontrollers; nonlinear dynamical systems; regression analysis; robust control; uncertain systems; Lyapunov-based stability analysis; actor NNs; actor neural networks; actor-critic-identifier structure; additive disturbances; approximate N-player nonzero-sum game solution; approximate control policies; approximate online equilibrium solution; continuous-time nonlinear unknown dynamics; convergence analysis; gradient-descent method; infinite horizon quadratic cost; least square regression; modified Bellman error; robust dynamic neural network; system dynamics; uncertain continuous nonlinear system; value functions; weight update laws; Artificial neural networks; Equations; Games; Nash equilibrium; Nonlinear dynamical systems; Optimal control; Actor–critic (AC) methods; Actor???critic (AC) methods; adaptive control; adaptive dynamic programming; differential games; optimal control;
  • fLanguage
    English
  • Journal_Title
    Neural Networks and Learning Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    2162-237X
  • Type

    jour

  • DOI
    10.1109/TNNLS.2014.2350835
  • Filename
    6918499