مرکز منطقه ای اطلاع رساني علوم و فناوري - Leader-Follower semi-Markov Decision Problems: Theoretical Framework and Approximate Solution

DocumentCode :

2717311

Title :

Leader-Follower semi-Markov Decision Problems: Theoretical Framework and Approximate Solution

Author :

Tharakunnel, Kurian ; Bhattacharyya, Siddhartha

Author_Institution :

Dept. of Inf. & Decision Sci., Illinois Univ., Chicago, IL

fYear :

2007

fDate :

1-5 April 2007

Firstpage :

111

Lastpage :

118

Abstract :

Leader-follower problems are hierarchical decision problems in which a leader uses incentives to induce certain desired behavior among a set of self-interested followers. Dynamic leader-follower problems extend this structure to multi-period decision situations. In this work we propose a Markov decision process (MDP) framework for a class of dynamic leader-follower problems that have important applications and discuss their approximate solution using reinforcement learning (RL). In these problems, the leader makes incentive decisions intermittently while the followers make their decisions in every period. Our theoretical framework and computational approach are based on the observation that such dynamic problems can be thought of as consisting of two coupled sequential decision processes, that of the leader and of the followers. In our formulation, the leader´s decision problem that has the structure of a single-agent semi-Markov decision process (SMDP), and the followers´ sequential decision problem structured as a stochastic game (multiagent competitive MDP) operate over the same state space. We call this MDP framework a leader-follower semi-Markov decision process (LFSMDP). We consider approximate solution of these problems using RL and demonstrate the solution approach in the special case where the followers´ stochastic game is a repeated game.

Keywords :

Markov processes; learning (artificial intelligence); stochastic games; Markov decision process; dynamic leader-follower problems; leader-follower semiMarkov decision problems; multiperiod decision situations; reinforcement learning; sequential decision problem; single-agent semiMarkov decision process; stochastic game; theoretical framework; Communication networks; Decision making; Dynamic programming; Electricity supply industry; Game theory; Learning; Peer to peer computing; Pricing; State-space methods; Stochastic processes;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Approximate Dynamic Programming and Reinforcement Learning, 2007. ADPRL 2007. IEEE International Symposium on

Conference_Location :

Honolulu, HI

Print_ISBN :

1-4244-0706-0

Type :

conf

DOI :

10.1109/ADPRL.2007.368177

Filename :

4220822

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2717311