Title :
Relative entropy and free energy dualities: Connections to Path Integral and KL control
Author :
Theodorou, Evangelos A. ; Todorov, Emo
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. of Washington, Seattle, WA, USA
Abstract :
This paper integrates recent work on Path Integral (PI) and Kullback Leibler (KL) divergence stochastic optimal control theory with earlier work on risk sensitivity and the fundamental dualities between free energy and relative entropy. We derive the path integral optimal control framework and its iterative version based on the aforemetioned dualities. The resulting formulation of iterative path integral control is valid for general feedback policies and in contrast to previous work, it does not rely on pre-specified policy parameterizations. The derivation is based on successive applications of Girsanov´s theorem and the use of Radon-Nikodým derivative as applied to diffusion processes due to the change of measure in the stochastic dynamics. We compare the PI control derived based on Dynamic Programming with PI based on the duality between free energy and relative entropy. Moreover we extend our analysis on the applicability of the relationship between free energy and relative entropy to optimal control of markov jump diffusions processes. Furthermore, we present the links between KL stochastic optimal control and the aforementioned dualities and discuss its generalizability.
Keywords :
Markov processes; diffusion; dynamic programming; entropy; feedback; free energy; iterative methods; optimal control; risk analysis; stochastic systems; Girsanov theorem; KL stochastic optimal control; Kullback Leibler divergence stochastic optimal control theory; Markov jump diffusion processes; PI control; Radon-Nikodym derivative; dynamic programming; free energy dualities; fundamental dualities; general feedback policies; iterative path integral control; iterative version; optimal control; path integral; path integral divergence stochastic optimal control theory; relative entropy; risk sensitivity; stochastic dynamics; Aerospace electronics; Cost function; Diffusion processes; Entropy; Markov processes; Optimal control; Trajectory;
Conference_Titel :
Decision and Control (CDC), 2012 IEEE 51st Annual Conference on
Conference_Location :
Maui, HI
Print_ISBN :
978-1-4673-2065-8
Electronic_ISBN :
0743-1546
DOI :
10.1109/CDC.2012.6426381