DocumentCode :
31488
Title :
Folding Algorithm for Policy Evaluation for Markov Decision Processes With Quasi-Birth Death Structure
Author :
Yassir, Yassir ; White, Langford B.
Author_Institution :
Sch. of Electr. & Electron. Eng., Univ. of Adelaide, Adelaide, SA, Australia
Volume :
60
Issue :
4
fYear :
2015
fDate :
Apr-15
Firstpage :
1164
Lastpage :
1168
Abstract :
This technical note presents a new numerical procedure for policy evaluation of Stochastic Shortest Path Markov Decision Processes (MDPs) having a level independent Quasi Birth-Death structure. The algorithm is derived using a method analogous to the folding method of Ye and Li (1994). The computational complexity is O(M3log2N) + O(M2N), where the process has N levels and M phases. A simple example involving the control of two queues is presented to illustrate the application of this efficient policy evaluation algorithm to compare and rank control policies.
Keywords :
Markov processes; computational complexity; dynamic programming; matrix algebra; probability; queueing theory; MDP; computational complexity; control policies; discrete state Markov process; dynamic programming; folding algorithm; level independent quasi birth-death structure; numerical procedure; policy evaluation algorithm; queue control; queueing analysis; stochastic shortest path Markov decision processes; transition probability matrix; Markov processes; Process control; Standards; Switches; Vectors; Yttrium; Dynamic programming; optimisation; queueing analysis;
fLanguage :
English
Journal_Title :
Automatic Control, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9286
Type :
jour
DOI :
10.1109/TAC.2014.2348803
Filename :
6879450
Link To Document :
بازگشت