مرکز منطقه ای اطلاع رساني علوم و فناوري - A distributed algorithm for solving a class of multi-agent Markov decision problems

DocumentCode :

404314

Title :

A distributed algorithm for solving a class of multi-agent Markov decision problems

Author :

Chang, Hyeong Soo ; Fu, Michael C.

Author_Institution :

Dept. of Comput. Sci. & Eng., Sogang Univ., Seoul, South Korea

Volume :

fYear :

2003

fDate :

9-12 Dec. 2003

Firstpage :

5341

Abstract :

This paper considers a class of infinite horizon Markov decision processes (MDPs) with multiple decision makers, called agents, and a general joint reward structure, but a special decomposable state/action structure such that each individual agent\´s actions affect the system\´s state transitions independently from the actions of all other agents. We introduce the concept of "localization," where each agent need only consider a "local" MDP defined on its own state and action spaces. Based on this localization concept, we propose an iterative distributed algorithm that emulates gradient ascent and which converges to a locally optimal solution for the average reward case. The solution is an "autonomous" joint policy such that each agent\´s action is based on only its local state.

Keywords :

Markov processes; decision making; distributed algorithms; infinite horizon; iterative methods; multi-agent systems; autonomous joint policy; decomposable state-action structure; infinite horizon; iterative distributed algorithm; localization; multiagent Markov decision processes; multiple decision makers; optimal solution; state transition; Business communication; Computer science; Decision making; Distributed algorithms; Distributed control; Educational institutions; Infinite horizon; Military computing; Protocols; Uncertainty;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Decision and Control, 2003. Proceedings. 42nd IEEE Conference on

ISSN :

0191-2216

Print_ISBN :

0-7803-7924-1

Type :

conf

DOI :

10.1109/CDC.2003.1272486

Filename :

1272486

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=404314