مرکز منطقه ای اطلاع رساني علوم و فناوري - An asymptotically efficient algorithm for finite horizon stochastic dynamic programming problems

DocumentCode :

404052

Title :

An asymptotically efficient algorithm for finite horizon stochastic dynamic programming problems

Author :

Chang, Hyeong Soo ; Fu, Michael C. ; Marcus, Steven I.

Author_Institution :

Dept. of Comput. Sci. & Eng., Sogang Univ., Seoul, South Korea

Volume :

fYear :

2003

fDate :

9-12 Dec. 2003

Firstpage :

3818

Abstract :

We present a novel algorithm, called "Simulated Annealing Multiplicative Weights", for approximately solving large (discrete-time) finite-horizon stochastic dynamic programming problems. The algorithm is "asymptotically efficient" in the sense that a finite-time bound for the sample mean of the optimal value function over a given finite policy space can be obtained, and the bound approaches the optimal value as the number of iterations increases. The algorithm updates a probability distribution over the given policy space with a very simple rule, and the sequence of distributions generated by the algorithm converges to a distribution concentrated only on the optimal policies for the given policy space. We also discuss how to reduce the computational cost of the algorithm to apply it in practice.

Keywords :

discrete time systems; dynamic programming; infinite horizon; probability; simulated annealing; stochastic programming; asymptotically efficient algorithm; computational cost reduction; discrete time system; finite horizon; finite policy space; finite time bound; iterations; optimal policies; optimal value function; probability distribution; simulated annealing multiplicative weights; stochastic dynamic programming; Annealing; Computational efficiency; Computer science; Control systems; Dynamic programming; Educational institutions; Probability distribution; Random variables; Stochastic processes; Uncertainty;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Decision and Control, 2003. Proceedings. 42nd IEEE Conference on

ISSN :

0191-2216

Print_ISBN :

0-7803-7924-1

Type :

conf

DOI :

10.1109/CDC.2003.1271744

Filename :

1271744

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=404052