Title :
Tree-based Fitted Q-iteration for Multi-Objective Markov Decision problems
Author :
Castelletti, Andrea ; Pianosi, Francesca ; Restelli, Marcello
Author_Institution :
Dept. of Electron. & Inf., Politec. di Milano, Milan, Italy
Abstract :
This paper is about solving multi-objective control problems using a model-free batch-mode reinforcement-learning approach. Although many real-world applications have several conflicting objectives, reinforcement-learning (RL) literature has mainly focused on single-objective control problems. As a consequence, in the presence of multiple objectives, the usual approach is to consider many single-objective control problems (resulting from different combinations of the original problem objectives), each one solved using standard RL techniques. The algorithm proposed in this paper is an extension of Fitted Q-iteration (FQI) that enables to learn the control policies for all the linear combinations of preferences (weights) assigned to the objectives in a single training process. The key idea of multi-objective FQI (MOFQI) is to enlarge the continuous approximation of the action-value function, which is performed by single-objective FQI over the state-action space, also to the weight space. The approach is demonstrated on an interesting real-world application for multi-objective RL algorithms: the optimal operation of a multi-purpose water reservoir.
Keywords :
Markov processes; decision making; iterative methods; learning systems; optimal control; reservoirs; trees (mathematics); MOFQI; model-free batch-mode reinforcement-learning approach; multiobjective FQI; multiobjective Markov decision problems; multiobjective control problems; multipurpose water reservoir; optimal operation; single-objective FQI; single-objective control problems; state-action space; tree-based fitted Q-iteration; Aerospace electronics; Approximation algorithms; Approximation methods; Reservoirs; Training; Vectors;
Conference_Titel :
Neural Networks (IJCNN), The 2012 International Joint Conference on
Conference_Location :
Brisbane, QLD
Print_ISBN :
978-1-4673-1488-6
Electronic_ISBN :
2161-4393
DOI :
10.1109/IJCNN.2012.6252759