Title :
Online learning algorithm for Stackelberg games in problems with hierarchy
Author :
Vamvoudakis, Kyriakos G. ; Lewis, Frank L. ; Johnson, Mark ; Dixon, Warren E.
Author_Institution :
Center for Control, Dynamical-Syst. & Comput. (CCDC), Univ. of California, Santa Barbara, Santa Barbara, CA, USA
Abstract :
This paper presents an online adaptive optimal control algorithm based on policy iteration reinforcement learning techniques to solve the continuous-time Stackelberg games with infinite horizon for linear systems. This adaptive optimal control method finds in real-time approximations of the optimal value and the Stackelberg-equilibrium solution, while also guaranteeing closed-loop stability. The optimal-adaptive algorithm is implemented as a separate actor/critic parametric network approximator structure for every player, and involves simultaneous continuous-time adaptation of the actor/critic networks. Novel tuning algorithms are given for the actor/critic networks. The convergence to the closed-loop Stackelberg equilibrium is proven and stability of the system is also guaranteed. A simulation example shows the effectiveness of the new online algorithm.
Keywords :
adaptive control; game theory; learning (artificial intelligence); optimal control; Stackelberg equilibrium solution; closed loop Stackelberg equilibrium; closed loop stability; continuous time Stackelberg games; continuous time adaptation; linear system; online adaptive optimal control algorithm; online learning algorithm; optimal adaptive algorithm; policy iteration reinforcement learning; tuning algorithm; Approximation algorithms; Artificial neural networks; Equations; Function approximation; Games; Tuning; Stackelberg games; hierarchical control problems;
Conference_Titel :
Decision and Control (CDC), 2012 IEEE 51st Annual Conference on
Conference_Location :
Maui, HI
Print_ISBN :
978-1-4673-2065-8
Electronic_ISBN :
0743-1546
DOI :
10.1109/CDC.2012.6426969