Approximate $N$ -Player Nonzero-Sum Game Solution for an Uncertain Continuous Nonlinear System

Author

Johnson, Marcus ; Kamalapurkar, Rushikesh ; Bhasin, Shubhendu ; Dixon, Warren E.

Author_Institution

Dept. of Mech. & Aerosp. Eng., Univ. of Florida, Gainesville, FL, USA

Volume

26

Issue

8

fYear

2015

fDate

Aug. 2015

Firstpage

1645

Lastpage

1658

Abstract

An approximate online equilibrium solution is developed for an N-player nonzero-sum game subject to continuous-time nonlinear unknown dynamics and an infinite horizon quadratic cost. A novel actor-critic-identifier structure is used, wherein a robust dynamic neural network is used to asymptotically identify the uncertain system with additive disturbances, and a set of critic and actor NNs are used to approximate the value functions and equilibrium policies, respectively. The weight update laws for the actor neural networks (NNs) are generated using a gradient-descent method, and the critic NNs are generated by least square regression, which are both based on the modified Bellman error that is independent of the system dynamics. A Lyapunov-based stability analysis shows that uniformly ultimately bounded tracking is achieved, and a convergence analysis demonstrates that the approximate control policies converge to a neighborhood of the optimal solutions. The actor, critic, and identifier structures are implemented in real time continuously and simultaneously. Simulations on two and three player games illustrate the performance of the developed method.

Keywords

Lyapunov methods; continuous systems; continuous time systems; convergence; game theory; gradient methods; infinite horizon; least squares approximations; neurocontrollers; nonlinear dynamical systems; regression analysis; robust control; uncertain systems; Lyapunov-based stability analysis; actor NNs; actor neural networks; actor-critic-identifier structure; additive disturbances; approximate N-player nonzero-sum game solution; approximate control policies; approximate online equilibrium solution; continuous-time nonlinear unknown dynamics; convergence analysis; gradient-descent method; infinite horizon quadratic cost; least square regression; modified Bellman error; robust dynamic neural network; system dynamics; uncertain continuous nonlinear system; value functions; weight update laws; Artificial neural networks; Equations; Games; Nash equilibrium; Nonlinear dynamical systems; Optimal control; Actor–critic (AC) methods; Actor???critic (AC) methods; adaptive control; adaptive dynamic programming; differential games; optimal control;

fLanguage

English

Journal_Title

Neural Networks and Learning Systems, IEEE Transactions on

Publisher

ieee

ISSN

2162-237X

Type

jour

DOI

10.1109/TNNLS.2014.2350835

Filename

6918499

Approximate -Player Nonzero-Sum Game Solution for an Uncertain Continuous Nonlinear System

Johnson, Marcus ; Kamalapurkar, Rushikesh ; Bhasin, Shubhendu ; Dixon, Warren E.

jour

Approximate $N$ -Player Nonzero-Sum Game Solution for an Uncertain Continuous Nonlinear System