• DocumentCode
    2777039
  • Title

    Tree-based Fitted Q-iteration for Multi-Objective Markov Decision problems

  • Author

    Castelletti, Andrea ; Pianosi, Francesca ; Restelli, Marcello

  • Author_Institution
    Dept. of Electron. & Inf., Politec. di Milano, Milan, Italy
  • fYear
    2012
  • fDate
    10-15 June 2012
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    This paper is about solving multi-objective control problems using a model-free batch-mode reinforcement-learning approach. Although many real-world applications have several conflicting objectives, reinforcement-learning (RL) literature has mainly focused on single-objective control problems. As a consequence, in the presence of multiple objectives, the usual approach is to consider many single-objective control problems (resulting from different combinations of the original problem objectives), each one solved using standard RL techniques. The algorithm proposed in this paper is an extension of Fitted Q-iteration (FQI) that enables to learn the control policies for all the linear combinations of preferences (weights) assigned to the objectives in a single training process. The key idea of multi-objective FQI (MOFQI) is to enlarge the continuous approximation of the action-value function, which is performed by single-objective FQI over the state-action space, also to the weight space. The approach is demonstrated on an interesting real-world application for multi-objective RL algorithms: the optimal operation of a multi-purpose water reservoir.
  • Keywords
    Markov processes; decision making; iterative methods; learning systems; optimal control; reservoirs; trees (mathematics); MOFQI; model-free batch-mode reinforcement-learning approach; multiobjective FQI; multiobjective Markov decision problems; multiobjective control problems; multipurpose water reservoir; optimal operation; single-objective FQI; single-objective control problems; state-action space; tree-based fitted Q-iteration; Aerospace electronics; Approximation algorithms; Approximation methods; Reservoirs; Training; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks (IJCNN), The 2012 International Joint Conference on
  • Conference_Location
    Brisbane, QLD
  • ISSN
    2161-4393
  • Print_ISBN
    978-1-4673-1488-6
  • Electronic_ISBN
    2161-4393
  • Type

    conf

  • DOI
    10.1109/IJCNN.2012.6252759
  • Filename
    6252759