Title : 
A sensitivity-based construction approach to sample-path variance minimization of Markov decision processes
         
        
            Author : 
Yonghao Huang ; Xi Chen
         
        
            Author_Institution : 
Dept. of Autom., Tsinghua Univ., Beijing, China
         
        
        
        
        
        
            Abstract : 
We study the limiting average variance along the sample path as the secondary criterion for Markov decision processes, with the long-run average performance as the primary criterion. By applying the sensitivity-based approach, we intuitively construct the difference formula for the sample-path variance under different policies. Thereby, a sufficient condition for the sample-path variance optimality can be easily derived. This work extends the sensitivity-based construction approach to the Markov decision processes with the nonstandard performance criterion. Compared with the pure mathematical verification, the sensitivity-based construction approach shows more intuition and provides insights on the sample-path structure of Markov decision processes.
         
        
            Keywords : 
Markov processes; decision making; minimisation; Markov decision processes; limiting average variance; long-run average performance; nonstandard performance criterion; pure mathematical verification; sample-path variance minimization; sample-path variance optimality; sensitivity-based construction approach; Australia; Limiting; Markov processes; Minimization; Optimization; Poisson equations; Vectors;
         
        
        
        
            Conference_Titel : 
Control Conference (AUCC), 2012 2nd Australian
         
        
            Conference_Location : 
Sydney, NSW
         
        
            Print_ISBN : 
978-1-922107-63-3