DocumentCode :
2584386
Title :
A distributed joint-learning and auction algorithm for target assignment
Author :
Sadikhov, Teymur ; Zhu, Minghui ; Martínez, Sonia
Author_Institution :
Sch. of Aerosp. Eng., Georgia Inst. of Technol., Atlanta, GA, USA
fYear :
2010
fDate :
15-17 Dec. 2010
Firstpage :
5450
Lastpage :
5455
Abstract :
We consider an agent-target assignment problem in an unknown environment modeled as an undirected graph. Agents incur cost or reward while traveling on the edges of this graph. Agents do not know the graph or the locations of the targets on it. However, they can obtain local information about these by local sensing and communicating with other agents within a limited range. To solve this problem, we come up with a new distributed algorithm that integrates Q-Learning and a distributed auction. The Q-Learning part helps estimate the assignment benefits calculated by summing up rewards over the graph edges for each agent-target pair, while the auction part takes care of assigning agents to targets in a distributed fashion. The algorithm is shown to terminate with a near-optimal assignment in a finite time. Optimality refers to the assignment benefit maximization, which can depend on a target-agent pair value, and the routing cost of the agent to visit the target.
Keywords :
distributed algorithms; graph theory; learning (artificial intelligence); Q-learning; agent target assignment problem; agent-target pair; assignment benefit maximization; auction algorithm; distributed algorithm; distributed auction; distributed fashion; distributed joint learning; finite time; graph edges; local information; local sensing; near-optimal assignment; optimality; target-agent pair value; undirected graph; unknown environment; Algorithm design and analysis; Convergence; Lead; Robustness; Routing; Sensors; Vehicles;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Decision and Control (CDC), 2010 49th IEEE Conference on
Conference_Location :
Atlanta, GA
ISSN :
0743-1546
Print_ISBN :
978-1-4244-7745-6
Type :
conf
DOI :
10.1109/CDC.2010.5718180
Filename :
5718180
Link To Document :
بازگشت