Title :
Continuous Real Time Dynamic Programming for Discrete and Continuous State MDPs
Author :
Rocha Vianna, Luis Gustavo ; Sanner, Scott ; Nunes de Barros, Leliane
Author_Institution :
Univ. of Sao Paulo, Sao Paulo, Brazil
Abstract :
Applications of automated planning under uncertainty are often modelled as a discrete and continuous state Markov Decision Process (DC-MDP). Symbolic Dynamic Programming is the existent exact solution for DC-MDPs that uses the eXtended Algebraic Decision Diagrams (XADDs) to symbolically represent the state value function and that computes a complete state-space policy (which is very costly and limits solution to problems with small size and depth). Real-Time Dynamic Programming (RTDP) is an efficient solution method for discrete state MDPs that provides a partial solution for a known initial state. In this paper we combine the RTDP solution with XADD symbolic representation and computation of the value function to propose the Continuous Real Time Dynamic Programming (CRTDP) algorithm. This novel planner uses heuristic search and symbolic generalisation to efficiently update the value function by regions. We show that using the initial state information greatly reduces the number of regions in the value function, therefore allowing CRTDP to solve DC-MDPs more efficiently than standard symbolic dynamic programming both in time and space required for the solution.
Keywords :
Markov processes; decision support systems; decision theory; dynamic programming; real-time systems; CRTDP algorithm; DC-MDP; RTDP solution; XADD symbolic representation; automated planning; continuous real time dynamic programming; continuous state MDP; continuous state Markov decision process; discrete state MDP; existent exact solution; extended algebraic decision diagrams; heuristic search; standard symbolic dynamic programming; state space policy; state value function; symbolic generalisation; Aerospace electronics; Dynamic programming; Heuristic algorithms; Inventory control; Mars; Planning; Real-time systems; Continuous RTDP; Continuous State Space Planning; Discrete and Continuous State MDPs; Markov Decision Process; Symbolic Dynamic Programming;
Conference_Titel :
Intelligent Systems (BRACIS), 2014 Brazilian Conference on
Conference_Location :
Sao Paulo
DOI :
10.1109/BRACIS.2014.34