DocumentCode
831968
Title
Algorithms for singularly perturbed limiting average Markov control problems
Author
Abbad, Mohammed ; Filar, Jerzy A. ; Bielecki, Tomasz R.
Author_Institution
Dept. of Math. & Stat., Maryland Univ., Baltimore, MD, USA
Volume
37
Issue
9
fYear
1992
fDate
9/1/1992 12:00:00 AM
Firstpage
1421
Lastpage
1425
Abstract
A singularly perturbed Markov decision process with the limiting average reward criterion is considered. It is assumed that the underlying process is composed of n separate irreducible processes, and that the small perturbation is such that it unites these processes into a single irreducible process. Two algorithms for the solution of the underlying limit Markov control problem are presented. The first of these is a linear program possessing the Wolfe-Dantzig structure inherited from the ergodic `nearly decomposable´ assumption in the model. The second is an aggregation-disaggregation policy improvement algorithm
Keywords
Markov processes; decision theory; linear programming; perturbation techniques; Wolfe-Dantzig structure; aggregation-disaggregation policy improvement algorithm; irreducible processes; limit Markov control problem; limiting average reward criterion; linear program; singularly perturbed Markov decision process; singularly perturbed limiting average Markov control problems; Mathematics; Probability; State-space methods; Statistics;
fLanguage
English
Journal_Title
Automatic Control, IEEE Transactions on
Publisher
ieee
ISSN
0018-9286
Type
jour
DOI
10.1109/9.159585
Filename
159585
Link To Document