DocumentCode
423961
Title
A solving method for MDPs by minimizing variational free energy
Author
Yoshimoto, Junichiro ; Ishii, Shin
Author_Institution
CREST, Japan Sci. & Technol. Agency, Japan
Volume
3
fYear
2004
fDate
25-29 July 2004
Firstpage
1817
Abstract
We propose a novel approach to acquire the optimal policy for a continuous Markov decision process. Based on an analogy from statistical mechanics, we introduce a variational free energy over a policy. A good policy can be obtained by minimizing the variational free energy. According to our approach, the optimal policy in linear quadratic regulator problems can be obtained by using Kalman filtering and smoothing techniques. Even in non-linear problems, a semi-optimal policy can be obtained by Monte Carlo technique with a Gaussian process method.
Keywords
Gaussian processes; Kalman filters; Markov processes; Monte Carlo methods; decision theory; linear quadratic control; minimisation; smoothing methods; statistical mechanics; Gaussian process method; Kalman filtering; Monte Carlo technique; continuous Markov decision process; linear quadratic regulator; nonlinear problems; semioptimal policy; smoothing techniques; statistical mechanics; variational free energy minimization; Cost function; Decision making; Filtering; Gradient methods; Kalman filters; Nonlinear filters; Probability distribution; Regulators; Smoothing methods; Stochastic processes;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks, 2004. Proceedings. 2004 IEEE International Joint Conference on
ISSN
1098-7576
Print_ISBN
0-7803-8359-1
Type
conf
DOI
10.1109/IJCNN.2004.1380884
Filename
1380884
Link To Document