A Tensor Factorization Approach to Generalization in Multi-agent Reinforcement Learning

Author

Bromuri, S.

Author_Institution

Dept. of Bus. Inf. Syst., Univ. of Appl. Sci. Western Switzerland, Sierre, Switzerland

Volume

2

fYear

2012

fDate

4-7 Dec. 2012

Firstpage

274

Lastpage

281

Abstract

Reinforcement learning (RL) and multi-agent reinforcement learning (MARL) are disciplines concerned with defining automatically the behaviour of an agent, or a set of interacting agents, by means of reward mechanisms coming from the environment. An important research issue in the context of RL and MARL is the definition of approaches to combine the knowledge of multiple learning agents to improve the overall performance of the multi-agent system (MAS). This paper illustrates how to improve RL and MARL algorithms by utilizing results from multi-linear algebra such as tensors and tensor factorizations. In particular, the focus is on showing how to modify existing algorithms from literature to include a tensor factorization step applied to the Q-Tables learned by the individual agents to generalize the knowledge about the actions performed in the environment. The modified algorithms are then evaluated in three RL and MARL scenarios against their unmodified version to show the benefits of the tensor factorization step.

Keywords

generalisation (artificial intelligence); learning (artificial intelligence); multi-agent systems; tensors; MARL algorithms; MAS; Q-tables; generalization; interacting agents; modified algorithms; multiagent reinforcement learning; multiagent system; multilinear algebra; multiple learning agents; reward mechanisms; tensor factorization approach; tensor factorizations; tensors; Algorithms; Dynamic Programming; Learning Systems; Software Agents;

fLanguage

English

Publisher

ieee

Conference_Titel

Web Intelligence and Intelligent Agent Technology (WI-IAT), 2012 IEEE/WIC/ACM International Conferences on

Conference_Location

Macau

Print_ISBN

978-1-4673-6057-9

Type

conf

DOI

10.1109/WI-IAT.2012.21

Filename

6511581