مرکز منطقه ای اطلاع رساني علوم و فناوري - Partially Observable Markov Decision Processes and Performance Sensitivity Analysis

DocumentCode :

863805

Title :

Partially Observable Markov Decision Processes and Performance Sensitivity Analysis

Author :

Li, Yanjie ; Yin, Baoqun ; Xi, Hongsheng

Author_Institution :

Dept. of Autom., Univ. of Sci. & Technol. of China, Hefei

Volume :

Issue :

fYear :

2008

Firstpage :

1645

Lastpage :

1651

Abstract :

The sensitivity-based optimization of Markov systems has become an increasingly important area. From the perspective of performance sensitivity analysis, policy-iteration algorithms and gradient estimation methods can be directly obtained for Markov decision processes (MDPs). In this correspondence, the sensitivity-based optimization is extended to average reward partially observable MDPs (POMDPs). We derive the performance-difference and performance-derivative formulas of POMDPs. On the basis of the performance-derivative formula, we present a new method to estimate the performance gradients. From the performance-difference formula, we obtain a sufficient optimality condition without the discounted reward formulation. We also propose a policy-iteration algorithm to obtain a nearly optimal finite-state-controller policy.

Keywords :

Markov processes; estimation theory; gradient methods; optimisation; sensitivity analysis; MDP; Markov systems; POMDP; gradient estimation methods; optimal finite-state-controller policy; partially observable Markov decision processes; performance sensitivity analysis; performance-derivative formula; performance-difference formula; policy-iteration algorithms; sensitivity-based optimization; Finite-state controller (FSC); gradient estimation; partially observable Markov decision processes (POMDPs); policy iteration; sensitivity analysis; Algorithms; Artificial Intelligence; Computer Simulation; Decision Making; Decision Support Techniques; Markov Chains; Models, Statistical; Pattern Recognition, Automated; Sensitivity and Specificity;

fLanguage :

English

Journal_Title :

Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on

Publisher :

ieee

ISSN :

1083-4419

Type :

jour

DOI :

10.1109/TSMCB.2008.927711

Filename :

4625978

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=863805