DocumentCode :
1413430
Title :
Evolutionary Policy Iteration Under a Sampling Regime for Stochastic Combinatorial Optimization
Author :
Hannah, Lauren A. ; Powell, Warren B.
Author_Institution :
Dept. of Oper. Res. & Financial Eng., Princeton Univ., Princeton, NJ, USA
Volume :
55
Issue :
5
fYear :
2010
fDate :
5/1/2010 12:00:00 AM
Firstpage :
1254
Lastpage :
1257
Abstract :
This article modifies the evolutionary policy selection algorithm of Chang et al., which was designed for use in infinite horizon Markov decision processes (MDPs) with a large action space to a discrete stochastic optimization problem, in an algorithm called Evolutionary Policy Iteration-Monte Carlo (EPI-MC). EPI-MC allows EPI to be used in a stochastic combinatorial optimization setting with a finite action space and a noisy cost (value) function by introducing a sampling schedule. Convergence of EPI-MC to the optimal action is proven and experimental results are given.
Keywords :
Markov processes; Monte Carlo methods; combinatorial mathematics; discrete systems; optimisation; sampling methods; stochastic systems; discrete stochastic optimization; evolutionary policy iteration-Monte Carlo; infinite horizon Markov decision process; sampling regime; stochastic combinatorial optimization; Algorithm design and analysis; Ant colony optimization; Convergence; Cost function; Design optimization; Genetic mutations; Infinite horizon; Monte Carlo methods; Operations research; Sampling methods; State-space methods; Stochastic processes; Combinatorial optimization; Monte Carlo (MC); evolutionary policy iteration (EPI); stochastic optimization;
fLanguage :
English
Journal_Title :
Automatic Control, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9286
Type :
jour
DOI :
10.1109/TAC.2010.2042766
Filename :
5409644
Link To Document :
بازگشت