Title :
Decentralized Learning in Markov Games
Author :
Vrancx, Peter ; Verbeeck, Katja ; Nowé, Ann
Author_Institution :
Comput. Modeling Lab., Vrije Univ. Brussel, Brussels
Abstract :
Learning automata (LA) were recently shown to be valuable tools for designing multiagent reinforcement learning algorithms. One of the principal contributions of the LA theory is that a set of decentralized independent LA is able to control a finite Markov chain with unknown transition probabilities and rewards. In this paper, we propose to extend this algorithm to Markov games-a straightforward extension of single-agent Markov decision problems to distributed multiagent decision problems. We show that under the same ergodic assumptions of the original theorem, the extended algorithm will converge to a pure equilibrium point between agent policies.
Keywords :
Markov processes; decentralised control; distributed algorithms; game theory; learning (artificial intelligence); learning automata; multi-agent systems; multivariable systems; Markov decision problem; Markov games; agent policy; decentralized learning automata; distributed multiagent decision problem; finite Markov chain; multiagent reinforcement learning; Game theory; multi-agent systems; reinforcement learning; stochastic automata; stochastic games; Algorithms; Artificial Intelligence; Computer Simulation; Game Theory; Markov Chains; Models, Theoretical;
Journal_Title :
Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
DOI :
10.1109/TSMCB.2008.920998