• DocumentCode
    3266045
  • Title

    A model for multi-time scaled sequential decision making processes

  • Author

    Chang, Hyeong Soo ; Fard, Pedram ; Marcus, Steven I. ; Shayman, Mark A.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Maryland Univ., College Park, MD, USA
  • Volume
    4
  • fYear
    2002
  • fDate
    10-13 Dec. 2002
  • Firstpage
    3813
  • Abstract
    We propose a simple analytical model, called M time-scale Markov Decision Process (MMDP), for hierarchically structured sequential decision making processes, where decisions in each level in the M-level hierarchy are made in M different time-scales. In this model, the state space and action space of each level in the hierarchy are non-overlapping with those of the other levels, respectively, and the hierarchy is structured in a "pyramid" sense such that a decision made at level m(slower time-scale) state and/or the state will affect the evolutionary decision making process of the lower level m+1 (faster time-scale) until a new decision is made at the higher level but the lower level decisions themselves do not affect the higher level transition dynamics. The performance produced by the lower level decisions will affect the higher level decisions. A hierarchical objective function is defined such that the finite-horizon value of following a (nonstationary) policy at the level m+1 over a decision epoch of the level m plus an immediate reward at the level m is the single step reward for the level m decision making process. From this we define "multi-level optimal value function" and derive "multi-level optimality equation". We then give some example control problems that can be modeled as MMDPs.
  • Keywords
    Markov processes; decision theory; hierarchical systems; probability; state-space methods; action space; evolutionary decision making process; finite horizon value; hierarchical objective function; hierarchical structure; higher level decisions; higher level transition dynamics; lower level decisions; multiple level optimal value function; multiple level optimality equation; multiple time scaled sequential decision making; nonoverlapping; nonstationary policy; state space; time scale Markov decision process; Analytical models; Context modeling; Contracts; Control systems; Decision making; Educational institutions; Equations; Large-scale systems; Levee; State-space methods;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Decision and Control, 2002, Proceedings of the 41st IEEE Conference on
  • ISSN
    0191-2216
  • Print_ISBN
    0-7803-7516-5
  • Type

    conf

  • DOI
    10.1109/CDC.2002.1184959
  • Filename
    1184959