Title :
A distributed algorithm for solving a class of multi-agent Markov decision problems
Author :
Chang, Hyeong Soo ; Fu, Michael C.
Author_Institution :
Dept. of Comput. Sci. & Eng., Sogang Univ., Seoul, South Korea
Abstract :
This paper considers a class of infinite horizon Markov decision processes (MDPs) with multiple decision makers, called agents, and a general joint reward structure, but a special decomposable state/action structure such that each individual agent\´s actions affect the system\´s state transitions independently from the actions of all other agents. We introduce the concept of "localization," where each agent need only consider a "local" MDP defined on its own state and action spaces. Based on this localization concept, we propose an iterative distributed algorithm that emulates gradient ascent and which converges to a locally optimal solution for the average reward case. The solution is an "autonomous" joint policy such that each agent\´s action is based on only its local state.
Keywords :
Markov processes; decision making; distributed algorithms; infinite horizon; iterative methods; multi-agent systems; autonomous joint policy; decomposable state-action structure; infinite horizon; iterative distributed algorithm; localization; multiagent Markov decision processes; multiple decision makers; optimal solution; state transition; Business communication; Computer science; Decision making; Distributed algorithms; Distributed control; Educational institutions; Infinite horizon; Military computing; Protocols; Uncertainty;
Conference_Titel :
Decision and Control, 2003. Proceedings. 42nd IEEE Conference on
Print_ISBN :
0-7803-7924-1
DOI :
10.1109/CDC.2003.1272486