DocumentCode
1797821
Title
A novel adaptive weight selection algorithm for multi-objective multi-agent reinforcement learning
Author
Van Moffaert, K. ; Brys, Tim ; Chandra, Aniruddha ; Esterle, Lukas ; Lewis, Peter R. ; Nowe, Ann
Author_Institution
Dept. of Comput. Sci., Vrije Univ. Brussel, Brussels, Belgium
fYear
2014
fDate
6-11 July 2014
Firstpage
2306
Lastpage
2314
Abstract
To solve multi-objective problems, multiple reward signals are often scalarized into a single value and further processed using established single-objective problem solving techniques. While the field of multi-objective optimization has made many advances in applying scalarization techniques to obtain good solution trade-offs, the utility of applying these techniques in the multi-objective multi-agent learning domain has not yet been thoroughly investigated. Agents learn the value of their decisions by linearly scalarizing their reward signals at the local level, while acceptable system wide behaviour results. However, the non-linear relationship between weighting parameters of the scalarization function and the learned policy makes the discovery of system wide trade-offs time consuming. Our first contribution is a thorough analysis of well known scalarization schemes within the multi-objective multi-agent reinforcement learning setup. The analysed approaches intelligently explore the weight-space in order to find a wider range of system trade-offs. In our second contribution, we propose a novel adaptive weight algorithm which interacts with the underlying local multi-objective solvers and allows for a better coverage of the Pareto front. Our third contribution is the experimental validation of our approach by learning bi-objective policies in self-organising smart camera networks. We note that our algorithm (i) explores the objective space faster on many problem instances, (ii) obtained solutions that exhibit a larger hypervolume, while (iii) acquiring a greater spread in the objective space.
Keywords
learning (artificial intelligence); multi-agent systems; optimisation; problem solving; self-adjusting systems; adaptive weight selection algorithm; multiagent reinforcement learning; multiobjective optimization; multiobjective problems; multiple reward signals; self-organising smart camera networks; single-objective problem solving techniques; Algorithm design and analysis; Cameras; Learning (artificial intelligence); Optimization; Search problems; Smart cameras; Space exploration;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks (IJCNN), 2014 International Joint Conference on
Conference_Location
Beijing
Print_ISBN
978-1-4799-6627-1
Type
conf
DOI
10.1109/IJCNN.2014.6889637
Filename
6889637
Link To Document