DocumentCode :
1408879
Title :
Markov Decisions on a Partitioned State Space
Author :
Smith, John L.
Author_Institution :
Division of Computing Research, Commonwealth Scientific and Industrial Research Organization, Canberra, Australian Capital Territory, Australia.
Issue :
1
fYear :
1971
Firstpage :
55
Lastpage :
60
Abstract :
An important practical constraint on admissible control policies is defined for the Markov decision process. The framework of an algorithm based on the infinite return optimization algorithms of Howard and Jewell is suggested to compute the optimal policy under this constraint. Iterative convergence to the optimal policy cannot be guaranteed, but techniques proposed for state-space reduction and rapid resolution of undetermined policies should render many problems tractable.
Keywords :
Australia; Control systems; Convergence; Cost function; Iterative algorithms; Mathematical model; Optimal control; Partitioning algorithms; State-space methods; Stochastic systems;
fLanguage :
English
Journal_Title :
Systems, Man and Cybernetics, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9472
Type :
jour
DOI :
10.1109/TSMC.1971.5408604
Filename :
5408604
Link To Document :
بازگشت