Title :
Decentralized Q-learning for weakly acyclic stochastic dynamic games
Author :
Gürdal Arslan;Serdar Yüksel
Author_Institution :
Department of Electrical Engineering, University of Hawaii at Manoa, 440 Holmes Hall, 2540 Dole Street, Honolulu, 96822, USA
Abstract :
There are only a few learning algorithms applicable to stochastic dynamic games. Learning in games is generally difficult because of the non-stationary environment in which each decision maker aims to learn its optimal decisions with minimal information in the presence of the other decision makers who are also learning. In the case of dynamic games, learning is more challenging because, while learning, the decision makers alter the state of the system and hence the future cost. In this paper, we present decentralized Q-learning algorithms for stochastic dynamic games, and study their convergence for the weakly acyclic case. We show that the decision makers employing these algorithms would eventually be using equilibrium policies almost surely in large classes of stochastic dynamic games.
Keywords :
"Games","Heuristic algorithms","Markov processes","Convergence","Cooperative systems","Standards","Aerospace electronics"
Conference_Titel :
Decision and Control (CDC), 2015 IEEE 54th Annual Conference on
DOI :
10.1109/CDC.2015.7403281