DocumentCode
1219672
Title
Multitime scale Markov decision processes
Author
Chang, Hyeong Soo ; Fard, Pedram Jaefari ; Marcus, Steven I. ; Shayman, Mark
Author_Institution
Dept. of Comput. Sci. & Eng., Sogang Univ., Seoul, South Korea
Volume
48
Issue
6
fYear
2003
fDate
6/1/2003 12:00:00 AM
Firstpage
976
Lastpage
987
Abstract
This paper proposes a simple analytical model called M time scale Markov decision process (MMDPs) for hierarchically structured sequential decision making processes, where decisions in each level in the M-level hierarchy are made in M different discrete time scales. In this model, the state-space and the control-space of each level in the hierarchy are nonoverlapping with those of the other levels, respectively, and the hierarchy is structured in a "pyramid" sense such that a decision made at level m (slower time scale) state and/or the state will affect the evolutionary decision making process of the lower level m+1 (faster time scale) until a new decision is made at the higher level but the lower level decisions themselves do not affect the transition dynamics of higher levels. The performance produced by the lower level decisions will affect the higher level decisions. A hierarchical objective function is defined such that the finite-horizon value of following a (nonstationary) policy at level m+1 over a decision epoch of level m plus an immediate reward at level m is the single-step reward for the decision making process at level m. From this we define "multi-level optimal value function" and derive "multi-level optimality equation." We discuss how to solve MMDPs exactly and study some approximation methods, along with heuristic sampling-based schemes, to solve MMDPs.
Keywords
Markov processes; decision theory; hierarchical systems; state-space methods; M time scale Markov decision process; analytical model; control-space method; hierarchical objective function; hierarchically structured sequential decision making processes; state-space method; Analytical models; Approximation methods; Computer science; Context modeling; Decision making; Equations; Process control;
fLanguage
English
Journal_Title
Automatic Control, IEEE Transactions on
Publisher
ieee
ISSN
0018-9286
Type
jour
DOI
10.1109/TAC.2003.812782
Filename
1205191
Link To Document