DocumentCode :
2717311
Title :
Leader-Follower semi-Markov Decision Problems: Theoretical Framework and Approximate Solution
Author :
Tharakunnel, Kurian ; Bhattacharyya, Siddhartha
Author_Institution :
Dept. of Inf. & Decision Sci., Illinois Univ., Chicago, IL
fYear :
2007
fDate :
1-5 April 2007
Firstpage :
111
Lastpage :
118
Abstract :
Leader-follower problems are hierarchical decision problems in which a leader uses incentives to induce certain desired behavior among a set of self-interested followers. Dynamic leader-follower problems extend this structure to multi-period decision situations. In this work we propose a Markov decision process (MDP) framework for a class of dynamic leader-follower problems that have important applications and discuss their approximate solution using reinforcement learning (RL). In these problems, the leader makes incentive decisions intermittently while the followers make their decisions in every period. Our theoretical framework and computational approach are based on the observation that such dynamic problems can be thought of as consisting of two coupled sequential decision processes, that of the leader and of the followers. In our formulation, the leader´s decision problem that has the structure of a single-agent semi-Markov decision process (SMDP), and the followers´ sequential decision problem structured as a stochastic game (multiagent competitive MDP) operate over the same state space. We call this MDP framework a leader-follower semi-Markov decision process (LFSMDP). We consider approximate solution of these problems using RL and demonstrate the solution approach in the special case where the followers´ stochastic game is a repeated game.
Keywords :
Markov processes; learning (artificial intelligence); stochastic games; Markov decision process; dynamic leader-follower problems; leader-follower semiMarkov decision problems; multiperiod decision situations; reinforcement learning; sequential decision problem; single-agent semiMarkov decision process; stochastic game; theoretical framework; Communication networks; Decision making; Dynamic programming; Electricity supply industry; Game theory; Learning; Peer to peer computing; Pricing; State-space methods; Stochastic processes;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Approximate Dynamic Programming and Reinforcement Learning, 2007. ADPRL 2007. IEEE International Symposium on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0706-0
Type :
conf
DOI :
10.1109/ADPRL.2007.368177
Filename :
4220822
Link To Document :
بازگشت