DocumentCode :
1798057
Title :
Multi-objective χ-Armed bandits
Author :
Van Moffaert, K. ; Van Vaerenbergh, Kevin ; Vrancx, Peter ; Nowe, Ann
Author_Institution :
Dept. of Comput. Sci., Vrije Univ. Brussel, Brussels, Belgium
fYear :
2014
fDate :
6-11 July 2014
Firstpage :
2331
Lastpage :
2338
Abstract :
Many of the standard optimization algorithms focus on optimizing a single, scalar feedback signal. However, real-life optimization problems often require a simultaneous optimization of more than one objective. In this paper, we propose a multi-objective extension to the standard χ-armed bandit problem. As the feedback signal is now vector-valued, the goal of the agent is to sample actions in the Pareto dominating area of the objective space. Therefore, we propose the multi-objective Hierarchical Optimistic Optimization strategy that discretizes the continuous action space in relation to the Pareto optimal solutions obtained in the multi-objective objective space. We experimentally validate the approach on two well-known multi-objective test functions and a simulation of a real life application, the filling phase of a wet clutch. We demonstrate that the strategy allows to identify the Pareto front after just a few epochs and to sample accordingly. After learning, several multi-objective quality indicators indicate that the set of sampled solutions by the algorithm very closely approximates the Pareto front.
Keywords :
Pareto optimisation; feedback; Pareto dominating area; Pareto front; Pareto optimal solutions; continuous action space; multiobjective χ-armed bandits; multiobjective extension; multiobjective hierarchical optimistic optimization strategy; multiobjective quality indicators; multiobjective test functions; optimization algorithms; scalar feedback signal; vector-valued feedback signal; Approximation algorithms; Pareto optimization; Pistons; Shafts; Torque; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks (IJCNN), 2014 International Joint Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4799-6627-1
Type :
conf
DOI :
10.1109/IJCNN.2014.6889753
Filename :
6889753
Link To Document :
بازگشت