DocumentCode :
2575518
Title :
State aggregation based linear programming approach to approximate dynamic programming
Author :
Darbha, S. ; Krishnamoorthy, K. ; Pachter, M. ; Chandler, P.
Author_Institution :
Dept. of Mech. Eng., Texas A&M Univ., College Station, TX, USA
fYear :
2010
fDate :
15-17 Dec. 2010
Firstpage :
935
Lastpage :
941
Abstract :
One often encounters the curse of dimensionality in the application of dynamic programming to determine optimal policies for controlled Markov chains. In this paper, we provide a method to construct sub-optimal policies along with a bound for the deviation of such a policy from the optimum through the use of restricted linear programming. The novelty of this approach lies in circumventing the need for a value iteration or a linear program defined on the entire state-space. Instead, the state-space is partitioned based on the reward structure and the optimal cost-to-go or value function is approximated by a constant over each partition. We associate a meta-state with each partition, where the transition probabilities between these meta-states can be derived from the original Markov chain specification. The state aggregation approach results in a significant reduction in the computational burden and lends itself to a restricted linear program defined on the aggregated state-space. Finally, the proposed method is bench marked on a perimeter surveillance stochastic control problem.
Keywords :
Markov processes; approximation theory; dynamic programming; iterative methods; linear programming; stochastic systems; controlled Markov chains; dynamic programming; optimal cost-to-go; perimeter surveillance stochastic control problem; restricted linear programming; state aggregation based linear programming; transition probabilities; value function; value iteration; Approximation methods; Delay; Equations; Linear programming; Markov processes; Silicon; Unmanned aerial vehicles;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Decision and Control (CDC), 2010 49th IEEE Conference on
Conference_Location :
Atlanta, GA
ISSN :
0743-1546
Print_ISBN :
978-1-4244-7745-6
Type :
conf
DOI :
10.1109/CDC.2010.5717627
Filename :
5717627
Link To Document :
بازگشت