DocumentCode :
3170853
Title :
Relative entropy and free energy dualities: Connections to Path Integral and KL control
Author :
Theodorou, Evangelos A. ; Todorov, Emo
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. of Washington, Seattle, WA, USA
fYear :
2012
fDate :
10-13 Dec. 2012
Firstpage :
1466
Lastpage :
1473
Abstract :
This paper integrates recent work on Path Integral (PI) and Kullback Leibler (KL) divergence stochastic optimal control theory with earlier work on risk sensitivity and the fundamental dualities between free energy and relative entropy. We derive the path integral optimal control framework and its iterative version based on the aforemetioned dualities. The resulting formulation of iterative path integral control is valid for general feedback policies and in contrast to previous work, it does not rely on pre-specified policy parameterizations. The derivation is based on successive applications of Girsanov´s theorem and the use of Radon-Nikodým derivative as applied to diffusion processes due to the change of measure in the stochastic dynamics. We compare the PI control derived based on Dynamic Programming with PI based on the duality between free energy and relative entropy. Moreover we extend our analysis on the applicability of the relationship between free energy and relative entropy to optimal control of markov jump diffusions processes. Furthermore, we present the links between KL stochastic optimal control and the aforementioned dualities and discuss its generalizability.
Keywords :
Markov processes; diffusion; dynamic programming; entropy; feedback; free energy; iterative methods; optimal control; risk analysis; stochastic systems; Girsanov theorem; KL stochastic optimal control; Kullback Leibler divergence stochastic optimal control theory; Markov jump diffusion processes; PI control; Radon-Nikodym derivative; dynamic programming; free energy dualities; fundamental dualities; general feedback policies; iterative path integral control; iterative version; optimal control; path integral; path integral divergence stochastic optimal control theory; relative entropy; risk sensitivity; stochastic dynamics; Aerospace electronics; Cost function; Diffusion processes; Entropy; Markov processes; Optimal control; Trajectory;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Decision and Control (CDC), 2012 IEEE 51st Annual Conference on
Conference_Location :
Maui, HI
ISSN :
0743-1546
Print_ISBN :
978-1-4673-2065-8
Electronic_ISBN :
0743-1546
Type :
conf
DOI :
10.1109/CDC.2012.6426381
Filename :
6426381
Link To Document :
بازگشت