مرکز منطقه ای اطلاع رساني علوم و فناوري - Decentralized learning in multiple pursuer-evader Markov games

DocumentCode :

2743088

Title :

Decentralized learning in multiple pursuer-evader Markov games

Author :

Givigi, Sidney ; Schwartz, Howard M.

Author_Institution :

Dept. of Electr. & Comput. Eng., R. Mil. Coll., Kingston, ON, Canada

fYear :

2011

fDate :

20-23 June 2011

Firstpage :

1379

Lastpage :

1385

Abstract :

We represent the multiple pursuers and evaders game as a Markov game and each player as a decentralized unit that has to work independently in order to complete a task. Most proposed solutions for this distributed multiagent decision problem require some sort of central coordination. In this paper, we intend to model each player as a learning automata (LA) and let them evolve and adapt in order to solve the difficult problem they have at hand. We are also going to show that using the proposed learning process, the players´ policies will converge to an equilibrium point. Simulations of such scenarios with multiple pursuers and evaders are presented in order to show the feasibility of the approach.

Keywords :

Markov processes; game theory; learning (artificial intelligence); learning automata; multi-agent systems; multivariable systems; decentralized learning; distributed multiagent decision problem; learning automata; multiple pursuer-evader Markov games; Algorithm design and analysis; Convergence; Equations; Games; Markov processes; Mathematical model; Nash equilibrium;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Control & Automation (MED), 2011 19th Mediterranean Conference on

Conference_Location :

Corfu

Print_ISBN :

978-1-4577-0124-5

Type :

conf

DOI :

10.1109/MED.2011.5983135

Filename :

5983135

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2743088