Title :
Sampling-based algorithms for continuous-time POMDPs
Author :
Chaudhari, Pratik ; Karaman, Sertac ; Hsu, David ; Frazzoli, Emilio
Author_Institution :
Dept. of Aeronaut. & Astronaut., Massachusetts Inst. of Technol., Cambridge, MA, USA
Abstract :
This paper focuses on a continuous-time, continuous-space formulation of the stochastic optimal control problem with nonlinear dynamics and observation noise. We lay the mathematical foundations to construct, via incremental sampling, an approximating sequence of discrete-time finite-state partially observable Markov decision processes (POMDPs), such that the behavior of successive approximations converges to the behavior of the original continuous system in an appropriate sense. We also show that the optimal cost function and control policies for these POMDP approximations converge almost surely to their counterparts for the underlying continuous system in the limit. We demonstrate this approach on two popular continuous-time problems, viz., the Linear-Quadratic-Gaussian (LQG) control problem and the light-dark domain problem.
Keywords :
Markov processes; approximation theory; continuous time systems; cost optimal control; discrete time systems; linear quadratic Gaussian control; nonlinear control systems; sampling methods; stochastic systems; LQG; continuous-space formulation; continuous-time POMDP; continuous-time formulation; discrete-time finite-state partially observable Markov decision processes; incremental sampling-based algorithms; light-dark domain problem; linear-quadratic-Gaussian control problem; mathematical foundations; nonlinear dynamics; observation noise; optimal cost control policies; optimal cost function policies; sequence approximation; stochastic optimal control problem; Approximation methods; Cost function; Manganese; Markov processes; Tin; Trajectory;
Conference_Titel :
American Control Conference (ACC), 2013
Conference_Location :
Washington, DC
Print_ISBN :
978-1-4799-0177-7
DOI :
10.1109/ACC.2013.6580549