DocumentCode :
1101548
Title :
Partially Observed Stochastic Shortest Path Problems With Approximate Solution by Neurodynamic Programming
Author :
Patek, Stephen D.
Author_Institution :
Virginia Univ., Charlottesville
Volume :
37
Issue :
5
fYear :
2007
Firstpage :
710
Lastpage :
720
Abstract :
We analyze a class of Markov decision processes with imperfect state information that evolve on an infinite time horizon and have a total cost criterion. In particular, we are interested in problems with stochastic shortest path structure, assuming the following: 1) the existence of a policy that guarantees termination with probability one and 2) the property that any policy that fails to guarantee termination has infinite expected cost from some initial state. We also assume that termination is perfectly recognized. In this paper, we clarify and expand upon arguments (given in an earlier paper) for establishing the existence, uniqueness, and characterization of stationary optimal policies, and the convergence of value and policy iteration. We also present an illustrative example, involving the search for a partially observed target that moves randomly on a grid, and we develop a simulation-based algorithm (based on neurodynamic programming techniques) for computing policies that approximately minimize the expected number of stages to complete the search.
Keywords :
Markov processes; approximation theory; dynamic programming; Markov decision processes; approximate solution; infinite time horizon; neurodynamic programming; partially observed stochastic shortest path problems; simulation-based algorithm; Computational modeling; Convergence; Costs; Grid computing; Information analysis; Neurodynamics; Programming profession; Shortest path problem; Stochastic processes; Stochastic systems; Markov decision process; neuro-dynamic programming; stochastic shortest path;
fLanguage :
English
Journal_Title :
Systems, Man and Cybernetics, Part A: Systems and Humans, IEEE Transactions on
Publisher :
ieee
ISSN :
1083-4427
Type :
jour
DOI :
10.1109/TSMCA.2007.902662
Filename :
4292229
Link To Document :
بازگشت