DocumentCode :
3168771
Title :
Multi-agent persistent monitoring in stochastic environments with temporal logic constraints
Author :
Yushan Chen ; Kun Deng ; Belta, Calin
Author_Institution :
Boston Univ., Boston, MA, USA
fYear :
2012
fDate :
10-13 Dec. 2012
Firstpage :
2801
Lastpage :
2806
Abstract :
In this paper, we consider the problem of generating control policies for a team of robots moving in an environment containing elements with probabilistic behaviors. The team is required to achieve an optimal surveillance mission, in which a certain proposition needs to be satisfied infinitely often. The goal is to minimize the average time between satisfying instances of the proposition, while ensuring that the mission is accomplished. By modeling the robots as Transition Systems and the environmental elements as Markov Chains, the problem reduces to finding an optimal control policy satisfying a temporal logic specification on a Markov Decision Process. The existing approaches for this problem are computational intensive and therefore not feasible for a large environment or a large number of robots. To address this issue, we propose an approximate dynamic programming framework. Specifically, we choose a set of basis functions to approximate the optimal cost and find the best parameters for these functions based on the least-square approximation. We develop an approximate policy iteration algorithm to implement our framework. We provide illustrative case studies and evaluate our method through simulations.
Keywords :
Markov processes; decision making; dynamic programming; least squares approximations; multi-agent systems; multi-robot systems; optimal control; probability; temporal logic; Markov chains; Markov decision process; approximate dynamic programming framework; control policies; least-square approximation; multiagent persistent monitoring; optimal control policy; optimal surveillance mission; probabilistic behaviors; robots team; stochastic environments; temporal logic constraints; transition systems; Approximation methods; Clocks; Games; Markov processes; Monitoring; Robots; Silicon;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Decision and Control (CDC), 2012 IEEE 51st Annual Conference on
Conference_Location :
Maui, HI
ISSN :
0743-1546
Print_ISBN :
978-1-4673-2065-8
Electronic_ISBN :
0743-1546
Type :
conf
DOI :
10.1109/CDC.2012.6426280
Filename :
6426280
Link To Document :
بازگشت