DocumentCode
2567398
Title
Learning in mean-field oscillator games
Author
Yin, Huibing ; Mehta, Prashant G. ; Meyn, Sean P. ; Shanbhag, Uday V.
Author_Institution
Dept. of Mech. Sci. & Eng., Univ. of Illinois at Urbana-Champaign (UIUC), Urbana, IL, USA
fYear
2010
fDate
15-17 Dec. 2010
Firstpage
3125
Lastpage
3132
Abstract
This research concerns a noncooperative dynamic game with large number of oscillators. The states are interpreted as the phase angles for a collection of non-homogeneous oscillators, and in this way the model may be regarded as an extension of the classical coupled oscillator model of Kuramoto. We introduce approximate dynamic programming (ADP) techniques for learning approximating optimal control laws for this model. Two types of parameterizations are considered, each of which is based on analysis of the deterministic PDE model introduced in our prior research. In an offline setting, a Galerkin procedure is introduced to choose the optimal parameters. In an online setting, a steepest descent stochastic approximation algorithm is proposed. We provide detailed analysis of the optimal parameter values as well as the Bellman error with both the Galerkin approximation and the online algorithm. Finally, a phase transition result is described for the large population limit when each oscillator uses the approximately optimal control law. A critical value of the control cost parameter is identified: Above this value, the oscillators are incoherent; and below this value (when control is sufficiently cheap) the oscillators synchronize. These conclusions are illustrated with results from numerical experiments.
Keywords
approximation theory; dynamic programming; game theory; learning (artificial intelligence); optimal control; oscillators; stochastic processes; Galerkin approximation; approximate dynamic programming; classical coupled oscillator model; deterministic PDE model; learning; mean field oscillator games; noncooperative dynamic game; optimal control laws; steepest descent stochastic approximation algorithm; Approximation algorithms; Approximation methods; Games; Mathematical model; Moment methods; Optimal control; Oscillators;
fLanguage
English
Publisher
ieee
Conference_Titel
Decision and Control (CDC), 2010 49th IEEE Conference on
Conference_Location
Atlanta, GA
ISSN
0743-1546
Print_ISBN
978-1-4244-7745-6
Type
conf
DOI
10.1109/CDC.2010.5717142
Filename
5717142
Link To Document