مرکز منطقه ای اطلاع رساني علوم و فناوري - Sample path optimality for a Markov optimization problem

Title of article :

Sample path optimality for a Markov optimization problem

Author/Authors :

Hunt، نويسنده , , F.Y.، نويسنده ,

Issue Information :

روزنامه با شماره پیاپی سال 2005

Pages :

From page :

769

To page :

779

Abstract :

We study a unichain Markov decision process i.e. a controlled Markov process whose state process under a stationary policy is an ergodic Markov chain. Here the state and action spaces are assumed to be either finite or countable. When the state process is uniformly ergodic and the immediate cost is bounded then a policy that minimizes the long-term expected average cost also has an nth stage sample path cost that with probability one is asymptotically less than the nth stage sample path cost under any other non-optimal stationary policy with a larger expected average cost. This is a strengthening in the Markov model case of the a.s. asymptotically optimal property frequently discussed in the literature.

Keywords :

stochastic control , Azumaיs inequality , Markov decision process

Journal title :

Stochastic Processes and their Applications

Serial Year :

2005

Journal title :

Stochastic Processes and their Applications

Record number :

1577611

Link To Document :

https://search.isc.ac/dl/search/defaultta.aspx?DTC=10&DC=1577611