Title :
MDP optimal control under temporal logic constraints
Author :
Ding, Xu Chu ; Smith, Stephen L. ; Belta, Calin ; Rus, Daniela
Author_Institution :
Dept. of Mech. Eng., Boston Univ., Boston, MA, USA
Abstract :
In this paper, we develop a method to automatically generate a control policy for a dynamical system modeled as a Markov Decision Process (MDP). The control specification is given as a Linear Temporal Logic (LTL) formula over a set of propositions defined on the states of the MDP. We synthesize a control policy such that the MDP satisfies the given specification almost surely, if such a policy exists. In addition, we designate an “optimizing proposition” to be repeatedly satisfied, and we formulate a novel optimization criterion in terms of minimizing the expected cost in between satisfactions of this proposition. We propose a sufficient condition for a policy to be optimal, and develop a dynamic programming algorithm that synthesizes a policy that is optimal under some conditions, and sub-optimal otherwise. This problem is motivated by robotic applications requiring persistent tasks, such as environmental monitoring or data gathering, to be performed.
Keywords :
Markov processes; control engineering computing; control system synthesis; dynamic programming; optimal control; robots; temporal logic; Markov decision process optimal control; control policy; data gathering; dynamic programming algorithm; dynamical system; environmental monitoring; linear temporal logic formula; optimization criterion; optimizing proposition; robotic applications; Dynamic programming; Equations; Markov processes; Probabilistic logic; Strontium; Transient analysis; Vectors;
Conference_Titel :
Decision and Control and European Control Conference (CDC-ECC), 2011 50th IEEE Conference on
Conference_Location :
Orlando, FL
Print_ISBN :
978-1-61284-800-6
Electronic_ISBN :
0743-1546
DOI :
10.1109/CDC.2011.6161122