Weighted Markov decision processes with perturbation

Author

Liu, Ke ; Filar, Jerzy A.

Author_Institution

Sch. of Math., Univ. of South Australia, Adelaide, SA, Australia

Volume

fYear

1995

fDate

13-15 Dec 1995

Firstpage

2269

Abstract

Considers the weighted reward MDPs with perturbation. The authors give the proof of existence of a δ-optimal simple ultimately deterministic policy under the assumption of “scalar value”. The authors also prove that there exists a δ-i-optimal simple ultimately deterministic policy in the perturbed weighted MDP, for all ε ∈[0,ε´] even without the assumption of “scalar value”

Keywords

Markov processes; decision theory; δ-i-optimal simple ultimately deterministic policy; δ-optimal simple ultimately deterministic policy; weighted Markov decision processes; Australia; History; Mathematics; Radio frequency; Random variables; State-space methods; Statistics;

fLanguage

English

Publisher

ieee

Conference_Titel

Decision and Control, 1995., Proceedings of the 34th IEEE Conference on

Conference_Location

New Orleans, LA

ISSN

0191-2216

Print_ISBN

0-7803-2685-7

Type

conf

DOI

10.1109/CDC.1995.480541

Filename

480541

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=295134