DocumentCode :
582608
Title :
Best response learning based on Gaussian regression for multi-agent systems in continuous spaces
Author :
Haijun, Wei ; Xin, Chen ; Min, Wu ; Weihua, Cao
Author_Institution :
Sch. of Inf. Sci. & Eng., Central South Univ., Changsha, China
fYear :
2012
fDate :
25-27 July 2012
Firstpage :
6196
Lastpage :
6201
Abstract :
In the implementations of multi-agent systems, generalization is always viewed as one of the key issues before multi-agent reinforcement learning algorithms are applicable to continuous environments. The paper proposes a best response learning based on Gaussian regression for multi-agent systems in continuous spaces. With a new Q value with reduced dimension defined, the algorithm entitles agent to learning strategy adapting to others´ behaviors. To realize generalization, probabilistic model of state transition in the algorithm is constructed by using Gaussian regression, so that dynamic programming can be applied directly to generate the best response strategy. And both Q-function model and V-function model are built real time in order to generalize state and action spaces. Thus the learning agent is able to tracking partners´ strategies. In the simulation of Double-cart-pole, which is a typical coordinated control problem, even if dynamics is unknown in priori, the algorithm enables agent to learn coordinated strategy, and realize generalization of state space as well.
Keywords :
Gaussian processes; learning (artificial intelligence); multi-agent systems; regression analysis; Gaussian regression; Q value; Q-function model; V-function model; best response learning; continuous spaces; coordinated control problem; double-cart-pole; multiagent reinforcement learning algorithms; multiagent systems; reduced dimension; Decision support systems; Continues Spaces; Gaussian Regression; Multi-agent systems; Reinforcement learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Control Conference (CCC), 2012 31st Chinese
Conference_Location :
Hefei
ISSN :
1934-1768
Print_ISBN :
978-1-4673-2581-3
Type :
conf
Filename :
6391027
Link To Document :
بازگشت