Title :
Decentralized learning in multiple pursuer-evader Markov games
Author :
Givigi, Sidney ; Schwartz, Howard M.
Author_Institution :
Dept. of Electr. & Comput. Eng., R. Mil. Coll., Kingston, ON, Canada
Abstract :
We represent the multiple pursuers and evaders game as a Markov game and each player as a decentralized unit that has to work independently in order to complete a task. Most proposed solutions for this distributed multiagent decision problem require some sort of central coordination. In this paper, we intend to model each player as a learning automata (LA) and let them evolve and adapt in order to solve the difficult problem they have at hand. We are also going to show that using the proposed learning process, the players´ policies will converge to an equilibrium point. Simulations of such scenarios with multiple pursuers and evaders are presented in order to show the feasibility of the approach.
Keywords :
Markov processes; game theory; learning (artificial intelligence); learning automata; multi-agent systems; multivariable systems; decentralized learning; distributed multiagent decision problem; learning automata; multiple pursuer-evader Markov games; Algorithm design and analysis; Convergence; Equations; Games; Markov processes; Mathematical model; Nash equilibrium;
Conference_Titel :
Control & Automation (MED), 2011 19th Mediterranean Conference on
Conference_Location :
Corfu
Print_ISBN :
978-1-4577-0124-5
DOI :
10.1109/MED.2011.5983135