Title :
Markov Decisions on a Partitioned State Space
Author_Institution :
Division of Computing Research, Commonwealth Scientific and Industrial Research Organization, Canberra, Australian Capital Territory, Australia.
Abstract :
An important practical constraint on admissible control policies is defined for the Markov decision process. The framework of an algorithm based on the infinite return optimization algorithms of Howard and Jewell is suggested to compute the optimal policy under this constraint. Iterative convergence to the optimal policy cannot be guaranteed, but techniques proposed for state-space reduction and rapid resolution of undetermined policies should render many problems tractable.
Keywords :
Australia; Control systems; Convergence; Cost function; Iterative algorithms; Mathematical model; Optimal control; Partitioning algorithms; State-space methods; Stochastic systems;
Journal_Title :
Systems, Man and Cybernetics, IEEE Transactions on
DOI :
10.1109/TSMC.1971.5408604