Title :
A sensitivity-based construction approach to sample-path variance minimization of Markov decision processes
Author :
Yonghao Huang ; Xi Chen
Author_Institution :
Dept. of Autom., Tsinghua Univ., Beijing, China
Abstract :
We study the limiting average variance along the sample path as the secondary criterion for Markov decision processes, with the long-run average performance as the primary criterion. By applying the sensitivity-based approach, we intuitively construct the difference formula for the sample-path variance under different policies. Thereby, a sufficient condition for the sample-path variance optimality can be easily derived. This work extends the sensitivity-based construction approach to the Markov decision processes with the nonstandard performance criterion. Compared with the pure mathematical verification, the sensitivity-based construction approach shows more intuition and provides insights on the sample-path structure of Markov decision processes.
Keywords :
Markov processes; decision making; minimisation; Markov decision processes; limiting average variance; long-run average performance; nonstandard performance criterion; pure mathematical verification; sample-path variance minimization; sample-path variance optimality; sensitivity-based construction approach; Australia; Limiting; Markov processes; Minimization; Optimization; Poisson equations; Vectors;
Conference_Titel :
Control Conference (AUCC), 2012 2nd Australian
Conference_Location :
Sydney, NSW
Print_ISBN :
978-1-922107-63-3