DocumentCode :
3182351
Title :
Online learning algorithm for Stackelberg games in problems with hierarchy
Author :
Vamvoudakis, Kyriakos G. ; Lewis, Frank L. ; Johnson, Mark ; Dixon, Warren E.
Author_Institution :
Center for Control, Dynamical-Syst. & Comput. (CCDC), Univ. of California, Santa Barbara, Santa Barbara, CA, USA
fYear :
2012
fDate :
10-13 Dec. 2012
Firstpage :
1883
Lastpage :
1889
Abstract :
This paper presents an online adaptive optimal control algorithm based on policy iteration reinforcement learning techniques to solve the continuous-time Stackelberg games with infinite horizon for linear systems. This adaptive optimal control method finds in real-time approximations of the optimal value and the Stackelberg-equilibrium solution, while also guaranteeing closed-loop stability. The optimal-adaptive algorithm is implemented as a separate actor/critic parametric network approximator structure for every player, and involves simultaneous continuous-time adaptation of the actor/critic networks. Novel tuning algorithms are given for the actor/critic networks. The convergence to the closed-loop Stackelberg equilibrium is proven and stability of the system is also guaranteed. A simulation example shows the effectiveness of the new online algorithm.
Keywords :
adaptive control; game theory; learning (artificial intelligence); optimal control; Stackelberg equilibrium solution; closed loop Stackelberg equilibrium; closed loop stability; continuous time Stackelberg games; continuous time adaptation; linear system; online adaptive optimal control algorithm; online learning algorithm; optimal adaptive algorithm; policy iteration reinforcement learning; tuning algorithm; Approximation algorithms; Artificial neural networks; Equations; Function approximation; Games; Tuning; Stackelberg games; hierarchical control problems;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Decision and Control (CDC), 2012 IEEE 51st Annual Conference on
Conference_Location :
Maui, HI
ISSN :
0743-1546
Print_ISBN :
978-1-4673-2065-8
Electronic_ISBN :
0743-1546
Type :
conf
DOI :
10.1109/CDC.2012.6426969
Filename :
6426969
Link To Document :
بازگشت