DocumentCode
1765054
Title
Approximate
-Player Nonzero-Sum Game Solution for an Uncertain Continuous Nonlinear System
Author
Johnson, Marcus ; Kamalapurkar, Rushikesh ; Bhasin, Shubhendu ; Dixon, Warren E.
Author_Institution
Dept. of Mech. & Aerosp. Eng., Univ. of Florida, Gainesville, FL, USA
Volume
26
Issue
8
fYear
2015
fDate
Aug. 2015
Firstpage
1645
Lastpage
1658
Abstract
An approximate online equilibrium solution is developed for an N-player nonzero-sum game subject to continuous-time nonlinear unknown dynamics and an infinite horizon quadratic cost. A novel actor-critic-identifier structure is used, wherein a robust dynamic neural network is used to asymptotically identify the uncertain system with additive disturbances, and a set of critic and actor NNs are used to approximate the value functions and equilibrium policies, respectively. The weight update laws for the actor neural networks (NNs) are generated using a gradient-descent method, and the critic NNs are generated by least square regression, which are both based on the modified Bellman error that is independent of the system dynamics. A Lyapunov-based stability analysis shows that uniformly ultimately bounded tracking is achieved, and a convergence analysis demonstrates that the approximate control policies converge to a neighborhood of the optimal solutions. The actor, critic, and identifier structures are implemented in real time continuously and simultaneously. Simulations on two and three player games illustrate the performance of the developed method.
Keywords
Lyapunov methods; continuous systems; continuous time systems; convergence; game theory; gradient methods; infinite horizon; least squares approximations; neurocontrollers; nonlinear dynamical systems; regression analysis; robust control; uncertain systems; Lyapunov-based stability analysis; actor NNs; actor neural networks; actor-critic-identifier structure; additive disturbances; approximate N-player nonzero-sum game solution; approximate control policies; approximate online equilibrium solution; continuous-time nonlinear unknown dynamics; convergence analysis; gradient-descent method; infinite horizon quadratic cost; least square regression; modified Bellman error; robust dynamic neural network; system dynamics; uncertain continuous nonlinear system; value functions; weight update laws; Artificial neural networks; Equations; Games; Nash equilibrium; Nonlinear dynamical systems; Optimal control; Actor–critic (AC) methods; Actor???critic (AC) methods; adaptive control; adaptive dynamic programming; differential games; optimal control;
fLanguage
English
Journal_Title
Neural Networks and Learning Systems, IEEE Transactions on
Publisher
ieee
ISSN
2162-237X
Type
jour
DOI
10.1109/TNNLS.2014.2350835
Filename
6918499
Link To Document