مرکز منطقه ای اطلاع رساني علوم و فناوري - Multi-Agent Task Assignment in the Bandit Framework

DocumentCode :

2473146

Title :

Multi-Agent Task Assignment in the Bandit Framework

Author :

Ny, Jerome Le ; Dahleh, Munther ; Feron, Eric

Author_Institution :

Lab. for Inf. & Decision Syst., Massachusetts Inst. of Technol., Cambridge, MA

fYear :

2006

fDate :

13-15 Dec. 2006

Firstpage :

5281

Lastpage :

5286

Abstract :

We consider a task assignment problem for a fleet of UAVs in a surveillance/search mission. We formulate the problem as a restless bandits problem with switching costs and discounted rewards: there are TV sites to inspect, each one of them evolving as a Markov chain, with different transition probabilities if the site is inspected or not. The sites evolve independently of each other, there are transition costs c _ij for moving between sites i and j isin {1,..., N}, rewards when visiting the sites, and we maximize a mixed objective function of these costs and rewards. This problem is known to be PSPACE-hard. We present a systematic method, inspired from the work of Bertsimas and Nino-Mora (2000) on restless bandits, for deriving a linear programming relaxation for such locally decomposable MDPs. The relaxation is computable in polynomial-time offline, provides a bound on the achievable performance, as well as an approximation of the cost-to-go which can be used online in conjunction with standard suboptimal stochastic control methods. In particular, the one-step lookahead policy based on this approximate cost-to-go reduces to computing the optimal value of a linear assignment problem of size N. We present numerical experiments, for which we assess the quality of the heuristics using the performance bound

Keywords :

Markov processes; aerospace robotics; heuristic programming; linear programming; mobile robots; multi-robot systems; relaxation theory; remotely operated vehicles; surveillance; Markov chain; Markov decision processes; PSPACE-hard; UAV fleet; bandit framework; cost-to-go; linear assignment problem; linear programming relaxation; lookahead policy; multiagent task assignment; polynomial-time offline; restless bandits problem; suboptimal stochastic control; transition probability; umanned aerial vehicle; Collaboration; Cost function; Frequency measurement; History; Linear programming; Polynomials; Stochastic processes; Surveillance; USA Councils; Unmanned aerial vehicles;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Decision and Control, 2006 45th IEEE Conference on

Conference_Location :

San Diego, CA

Print_ISBN :

1-4244-0171-2

Type :

conf

DOI :

10.1109/CDC.2006.377612

Filename :

4177503

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2473146