مرکز منطقه ای اطلاع رساني علوم و فناوري - Maximizing the set of recurrent states of an MDP subject to convex constraints

Title of article :

Maximizing the set of recurrent states of an MDP subject to convex constraints

Author/Authors :

R. Arvelo، نويسنده , , Eduardo and Martins، نويسنده , , Nuno C. Santos، نويسنده ,

Issue Information :

روزنامه با شماره پیاپی سال 2014

Pages :

From page :

994

To page :

998

Abstract :

This paper focuses on the design of time-homogeneous fully observed Markov decision processes (MDPs), with finite state and action spaces. The main objective is to obtain policies that generate the maximal set of recurrent states, subject to convex constraints on the set of invariant probability mass functions. We propose a design method that relies on a finitely parametrized convex program inspired on principles of entropy maximization. A numerical example is provided to illustrate these ideas.

Keywords :

Maximum Entropy , Markov decision problems , Markov models , optimal control , Convex optimization

Journal title :

Automatica

Serial Year :

2014

Journal title :

Automatica

Record number :

1449717

Link To Document :

https://search.isc.ac/dl/search/defaultta.aspx?DTC=10&DC=1449717