Title :
A distributed joint-learning and auction algorithm for target assignment
Author :
Sadikhov, Teymur ; Zhu, Minghui ; Martínez, Sonia
Author_Institution :
Sch. of Aerosp. Eng., Georgia Inst. of Technol., Atlanta, GA, USA
Abstract :
We consider an agent-target assignment problem in an unknown environment modeled as an undirected graph. Agents incur cost or reward while traveling on the edges of this graph. Agents do not know the graph or the locations of the targets on it. However, they can obtain local information about these by local sensing and communicating with other agents within a limited range. To solve this problem, we come up with a new distributed algorithm that integrates Q-Learning and a distributed auction. The Q-Learning part helps estimate the assignment benefits calculated by summing up rewards over the graph edges for each agent-target pair, while the auction part takes care of assigning agents to targets in a distributed fashion. The algorithm is shown to terminate with a near-optimal assignment in a finite time. Optimality refers to the assignment benefit maximization, which can depend on a target-agent pair value, and the routing cost of the agent to visit the target.
Keywords :
distributed algorithms; graph theory; learning (artificial intelligence); Q-learning; agent target assignment problem; agent-target pair; assignment benefit maximization; auction algorithm; distributed algorithm; distributed auction; distributed fashion; distributed joint learning; finite time; graph edges; local information; local sensing; near-optimal assignment; optimality; target-agent pair value; undirected graph; unknown environment; Algorithm design and analysis; Convergence; Lead; Robustness; Routing; Sensors; Vehicles;
Conference_Titel :
Decision and Control (CDC), 2010 49th IEEE Conference on
Conference_Location :
Atlanta, GA
Print_ISBN :
978-1-4244-7745-6
DOI :
10.1109/CDC.2010.5718180