DocumentCode :
404052
Title :
An asymptotically efficient algorithm for finite horizon stochastic dynamic programming problems
Author :
Chang, Hyeong Soo ; Fu, Michael C. ; Marcus, Steven I.
Author_Institution :
Dept. of Comput. Sci. & Eng., Sogang Univ., Seoul, South Korea
Volume :
4
fYear :
2003
fDate :
9-12 Dec. 2003
Firstpage :
3818
Abstract :
We present a novel algorithm, called "Simulated Annealing Multiplicative Weights", for approximately solving large (discrete-time) finite-horizon stochastic dynamic programming problems. The algorithm is "asymptotically efficient" in the sense that a finite-time bound for the sample mean of the optimal value function over a given finite policy space can be obtained, and the bound approaches the optimal value as the number of iterations increases. The algorithm updates a probability distribution over the given policy space with a very simple rule, and the sequence of distributions generated by the algorithm converges to a distribution concentrated only on the optimal policies for the given policy space. We also discuss how to reduce the computational cost of the algorithm to apply it in practice.
Keywords :
discrete time systems; dynamic programming; infinite horizon; probability; simulated annealing; stochastic programming; asymptotically efficient algorithm; computational cost reduction; discrete time system; finite horizon; finite policy space; finite time bound; iterations; optimal policies; optimal value function; probability distribution; simulated annealing multiplicative weights; stochastic dynamic programming; Annealing; Computational efficiency; Computer science; Control systems; Dynamic programming; Educational institutions; Probability distribution; Random variables; Stochastic processes; Uncertainty;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Decision and Control, 2003. Proceedings. 42nd IEEE Conference on
ISSN :
0191-2216
Print_ISBN :
0-7803-7924-1
Type :
conf
DOI :
10.1109/CDC.2003.1271744
Filename :
1271744
Link To Document :
بازگشت