Title :
Weighted Markov decision processes with perturbation
Author :
Liu, Ke ; Filar, Jerzy A.
Author_Institution :
Sch. of Math., Univ. of South Australia, Adelaide, SA, Australia
Abstract :
Considers the weighted reward MDPs with perturbation. The authors give the proof of existence of a δ-optimal simple ultimately deterministic policy under the assumption of “scalar value”. The authors also prove that there exists a δ-i-optimal simple ultimately deterministic policy in the perturbed weighted MDP, for all ε ∈[0,ε´] even without the assumption of “scalar value”
Keywords :
Markov processes; decision theory; δ-i-optimal simple ultimately deterministic policy; δ-optimal simple ultimately deterministic policy; weighted Markov decision processes; Australia; History; Mathematics; Radio frequency; Random variables; State-space methods; Statistics;
Conference_Titel :
Decision and Control, 1995., Proceedings of the 34th IEEE Conference on
Conference_Location :
New Orleans, LA
Print_ISBN :
0-7803-2685-7
DOI :
10.1109/CDC.1995.480541