Title :
An algorithm for cooperative probabilistic control design
Author_Institution :
INESC-ID Lisboa, Evora Univ., Lisbon, Portugal
Abstract :
This paper deals with the decentralized closed loop control in a pure probabilistic framework. In this framework, a system is a controlled Markov chain whose transition probabilities depend on the actions of the agents. The agents are also described in a probabilistic way. The objective is to drive the system so that the joint state and agents actions are close to a set of given target probability distributions. The Kullback-Leibler divergence is used as a performance measure. The resulting algorithm uses dynamic programming interleaved with an iterative process that computes the behavior of each agent.
Keywords :
Markov processes; control system synthesis; distributed control; dynamic programming; iterative methods; multi-agent systems; statistical distributions; Kullback-Leibler divergence; agents actions; controlled Markov chain; cooperative probabilistic control design; decentralized closed loop control; dynamic programming; iterative process; probabilistic framework; target probability distributions; transition probabilities; Algorithm design and analysis; Control design; Games; Joints; Markov processes; Probabilistic logic;
Conference_Titel :
Control & Automation (MED), 2012 20th Mediterranean Conference on
Conference_Location :
Barcelona
Print_ISBN :
978-1-4673-2530-1
Electronic_ISBN :
978-1-4673-2529-5
DOI :
10.1109/MED.2012.6265795