• DocumentCode
    189160
  • Title

    Continuous Real Time Dynamic Programming for Discrete and Continuous State MDPs

  • Author

    Rocha Vianna, Luis Gustavo ; Sanner, Scott ; Nunes de Barros, Leliane

  • Author_Institution
    Univ. of Sao Paulo, Sao Paulo, Brazil
  • fYear
    2014
  • fDate
    18-22 Oct. 2014
  • Firstpage
    134
  • Lastpage
    139
  • Abstract
    Applications of automated planning under uncertainty are often modelled as a discrete and continuous state Markov Decision Process (DC-MDP). Symbolic Dynamic Programming is the existent exact solution for DC-MDPs that uses the eXtended Algebraic Decision Diagrams (XADDs) to symbolically represent the state value function and that computes a complete state-space policy (which is very costly and limits solution to problems with small size and depth). Real-Time Dynamic Programming (RTDP) is an efficient solution method for discrete state MDPs that provides a partial solution for a known initial state. In this paper we combine the RTDP solution with XADD symbolic representation and computation of the value function to propose the Continuous Real Time Dynamic Programming (CRTDP) algorithm. This novel planner uses heuristic search and symbolic generalisation to efficiently update the value function by regions. We show that using the initial state information greatly reduces the number of regions in the value function, therefore allowing CRTDP to solve DC-MDPs more efficiently than standard symbolic dynamic programming both in time and space required for the solution.
  • Keywords
    Markov processes; decision support systems; decision theory; dynamic programming; real-time systems; CRTDP algorithm; DC-MDP; RTDP solution; XADD symbolic representation; automated planning; continuous real time dynamic programming; continuous state MDP; continuous state Markov decision process; discrete state MDP; existent exact solution; extended algebraic decision diagrams; heuristic search; standard symbolic dynamic programming; state space policy; state value function; symbolic generalisation; Aerospace electronics; Dynamic programming; Heuristic algorithms; Inventory control; Mars; Planning; Real-time systems; Continuous RTDP; Continuous State Space Planning; Discrete and Continuous State MDPs; Markov Decision Process; Symbolic Dynamic Programming;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Systems (BRACIS), 2014 Brazilian Conference on
  • Conference_Location
    Sao Paulo
  • Type

    conf

  • DOI
    10.1109/BRACIS.2014.34
  • Filename
    6984820