DocumentCode :
1819142
Title :
Non-zero sum games: Online learning solution of coupled Hamilton-Jacobi and coupled Riccati equations
Author :
Vamvoudakis, Kyriakos G. ; Lewis, Frank L.
Author_Institution :
Autom. & Robot. Res. Inst., Univ. of Texas at Arlington, Fort Worth, TX, USA
fYear :
2011
fDate :
28-30 Sept. 2011
Firstpage :
171
Lastpage :
178
Abstract :
In this paper we present an online adaptive control algorithm based on policy iteration reinforcement learning techniques to solve the continuous-time (CT) multi player non zero sum (NZS) game with infinite horizon for linear and nonlinear systems. The adaptive algorithm learns online the solution of coupled Riccati equations and coupled Hamilton-Jacobi equations for linear and nonlinear systems respectively. This adaptive control method finds in real-time approximations of the optimal value and the NZS Nash-equilibrium, while also guaranteeing closed-loop stability. The optimal-adaptive algorithm is implemented as a separate actor/critic parametric network approximator structure for every player, and involves simultaneous continuous-time adaptation of the actor/critic networks. A persistence of excitation condition is shown to guarantee convergence of every critic to the actual optimal value function for that player. A detailed mathematical analysis is done for 2-player NZS games. Novel tuning algorithms are given for the actor/critic networks. The convergence to the ash equilibrium is proven and stability of the system is also guaranteed. Simulation examples show the effectiveness of the new algorithm.
Keywords :
Riccati equations; adaptive control; approximation theory; closed loop systems; continuous time systems; game theory; learning (artificial intelligence); linear systems; nonlinear control systems; stability; Hamilton-Jacobi equation; Nash equilibrium; Riccati equation; actor-critic parametric network approximator structure; closed-loop stability; continuous-time sum game; linear system; nonlinear system; nonzero sum game; online adaptive control algorithm; policy iteration reinforcement learning technique; realtime approximation; tuning algorithm; Adaptive control; Approximation algorithms; Approximation methods; Artificial neural networks; Equations; Games; Mathematical model; Coupled Hamilton-Jacobi equations; Coupled Riccati equations; Multi-Player games; adaptive optimal control; ash equilibrium;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Control (ISIC), 2011 IEEE International Symposium on
Conference_Location :
Denver, CO
ISSN :
2158-9860
Print_ISBN :
978-1-4577-1104-6
Electronic_ISBN :
2158-9860
Type :
conf
DOI :
10.1109/ISIC.2011.6045401
Filename :
6045401
Link To Document :
بازگشت