DocumentCode :
295134
Title :
Weighted Markov decision processes with perturbation
Author :
Liu, Ke ; Filar, Jerzy A.
Author_Institution :
Sch. of Math., Univ. of South Australia, Adelaide, SA, Australia
Volume :
3
fYear :
1995
fDate :
13-15 Dec 1995
Firstpage :
2269
Abstract :
Considers the weighted reward MDPs with perturbation. The authors give the proof of existence of a δ-optimal simple ultimately deterministic policy under the assumption of “scalar value”. The authors also prove that there exists a δ-i-optimal simple ultimately deterministic policy in the perturbed weighted MDP, for all ε ∈[0,ε´] even without the assumption of “scalar value”
Keywords :
Markov processes; decision theory; δ-i-optimal simple ultimately deterministic policy; δ-optimal simple ultimately deterministic policy; weighted Markov decision processes; Australia; History; Mathematics; Radio frequency; Random variables; State-space methods; Statistics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Decision and Control, 1995., Proceedings of the 34th IEEE Conference on
Conference_Location :
New Orleans, LA
ISSN :
0191-2216
Print_ISBN :
0-7803-2685-7
Type :
conf
DOI :
10.1109/CDC.1995.480541
Filename :
480541
Link To Document :
بازگشت