مرکز منطقه ای اطلاع رساني علوم و فناوري - Unifying Two Views on Multiple Mean-Payoff Objectives in Markov Decision Processes

DocumentCode :

728976

Title :

Unifying Two Views on Multiple Mean-Payoff Objectives in Markov Decision Processes

Author :

Chatterjee, Krishnendu ; Komarkova, Zuzana ; Kretinsky, Jan

Author_Institution :

IST Austria, Klosterneuburg, Austria

fYear :

2015

fDate :

6-10 July 2015

Firstpage :

244

Lastpage :

256

Abstract :

We consider Markov decision processes (MDPs) with multiple limit-average (or mean-payoff) objectives. There exist two different views: (i) the expectation semantics, where the goal is to optimize the expected mean-payoff objective, and (ii) the satisfaction semantics, where the goal is to maximize the probability of runs such that the mean-payoff value stays above a given vector. We consider optimization with respect to both objectives at once, thus unifying the existing semantics. Precisely, the goal is to optimize the expectation while ensuring the satisfaction constraint. Our problem captures the notion of optimization with respect to strategies that are risk-averse (i.e., ensure certain probabilistic guarantee). Our main results are as follows: First, we present algorithms for the decision problems, which are always polynomial in the size of the MDP. We also show that an approximation of the Pareto curve can be computed in time polynomial in the size of the MDP, and the approximation factor, but exponential in the number of dimensions. Second, we present a complete characterization of the strategy complexity (in terms of memory bounds and randomization) required to solve our problem.

Keywords :

Markov processes; Pareto optimisation; computational complexity; decision theory; probability; MDP; Markov decision processes; Pareto curve approximation; approximation factor; decision problem; expectation semantics; expected mean-payoff objective optimization; mean-payoff value; memory bound; multiple limit-average objectives; multiple mean-payoff objectives; randomization; run probability maximization; satisfaction semantics; strategy complexity; time polynomial; Complexity theory; Joints; Markov processes; Optimization; Polynomials; Probabilistic logic; Semantics; Markov decision processes; limit average reward; mean payoff;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Logic in Computer Science (LICS), 2015 30th Annual ACM/IEEE Symposium on

Conference_Location :

Kyoto

ISSN :

1043-6871

Type :

conf

DOI :

10.1109/LICS.2015.32

Filename :

7174886

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=728976