مرکز منطقه ای اطلاع رساني علوم و فناوري - Non-zero sum games: Online learning solution of coupled Hamilton-Jacobi and coupled Riccati equations

DocumentCode :

1819142

Title :

Non-zero sum games: Online learning solution of coupled Hamilton-Jacobi and coupled Riccati equations

Author :

Vamvoudakis, Kyriakos G. ; Lewis, Frank L.

Author_Institution :

Autom. & Robot. Res. Inst., Univ. of Texas at Arlington, Fort Worth, TX, USA

fYear :

2011

fDate :

28-30 Sept. 2011

Firstpage :

171

Lastpage :

178

Abstract :

In this paper we present an online adaptive control algorithm based on policy iteration reinforcement learning techniques to solve the continuous-time (CT) multi player non zero sum (NZS) game with infinite horizon for linear and nonlinear systems. The adaptive algorithm learns online the solution of coupled Riccati equations and coupled Hamilton-Jacobi equations for linear and nonlinear systems respectively. This adaptive control method finds in real-time approximations of the optimal value and the NZS Nash-equilibrium, while also guaranteeing closed-loop stability. The optimal-adaptive algorithm is implemented as a separate actor/critic parametric network approximator structure for every player, and involves simultaneous continuous-time adaptation of the actor/critic networks. A persistence of excitation condition is shown to guarantee convergence of every critic to the actual optimal value function for that player. A detailed mathematical analysis is done for 2-player NZS games. Novel tuning algorithms are given for the actor/critic networks. The convergence to the ash equilibrium is proven and stability of the system is also guaranteed. Simulation examples show the effectiveness of the new algorithm.

Keywords :

Riccati equations; adaptive control; approximation theory; closed loop systems; continuous time systems; game theory; learning (artificial intelligence); linear systems; nonlinear control systems; stability; Hamilton-Jacobi equation; Nash equilibrium; Riccati equation; actor-critic parametric network approximator structure; closed-loop stability; continuous-time sum game; linear system; nonlinear system; nonzero sum game; online adaptive control algorithm; policy iteration reinforcement learning technique; realtime approximation; tuning algorithm; Adaptive control; Approximation algorithms; Approximation methods; Artificial neural networks; Equations; Games; Mathematical model; Coupled Hamilton-Jacobi equations; Coupled Riccati equations; Multi-Player games; adaptive optimal control; ash equilibrium;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Intelligent Control (ISIC), 2011 IEEE International Symposium on

Conference_Location :

Denver, CO

ISSN :

2158-9860

Print_ISBN :

978-1-4577-1104-6

Electronic_ISBN :

2158-9860

Type :

conf

DOI :

10.1109/ISIC.2011.6045401

Filename :

6045401

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1819142