DocumentCode
2110342
Title
A Tensor Factorization Approach to Generalization in Multi-agent Reinforcement Learning
Author
Bromuri, S.
Author_Institution
Dept. of Bus. Inf. Syst., Univ. of Appl. Sci. Western Switzerland, Sierre, Switzerland
Volume
2
fYear
2012
fDate
4-7 Dec. 2012
Firstpage
274
Lastpage
281
Abstract
Reinforcement learning (RL) and multi-agent reinforcement learning (MARL) are disciplines concerned with defining automatically the behaviour of an agent, or a set of interacting agents, by means of reward mechanisms coming from the environment. An important research issue in the context of RL and MARL is the definition of approaches to combine the knowledge of multiple learning agents to improve the overall performance of the multi-agent system (MAS). This paper illustrates how to improve RL and MARL algorithms by utilizing results from multi-linear algebra such as tensors and tensor factorizations. In particular, the focus is on showing how to modify existing algorithms from literature to include a tensor factorization step applied to the Q-Tables learned by the individual agents to generalize the knowledge about the actions performed in the environment. The modified algorithms are then evaluated in three RL and MARL scenarios against their unmodified version to show the benefits of the tensor factorization step.
Keywords
generalisation (artificial intelligence); learning (artificial intelligence); multi-agent systems; tensors; MARL algorithms; MAS; Q-tables; generalization; interacting agents; modified algorithms; multiagent reinforcement learning; multiagent system; multilinear algebra; multiple learning agents; reward mechanisms; tensor factorization approach; tensor factorizations; tensors; Algorithms; Dynamic Programming; Learning Systems; Software Agents;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Intelligence and Intelligent Agent Technology (WI-IAT), 2012 IEEE/WIC/ACM International Conferences on
Conference_Location
Macau
Print_ISBN
978-1-4673-6057-9
Type
conf
DOI
10.1109/WI-IAT.2012.21
Filename
6511581
Link To Document