DocumentCode :
2403682
Title :
A class of two-dimensional stochastic approximations and steering policies for Markov decision processes
Author :
Ma, Dye-Jynn ; Makowski, Armand M.
Author_Institution :
Digital Equipment Corp., Littleton, MA, USA
fYear :
1992
fDate :
1992
Firstpage :
3344
Abstract :
The authors consider a specific multidimensional stochastic approximation scheme of the Robbins-Monro type that naturally arises in the study of steering policies for Markov decision processes. The usual convergence results (in the almost sure sense) do not seem to apply for this simple scheme. Almost sure convergence is established by an indirect argument that blends standard results on stochastic approximations with a version of the law of large number for martingale differences. These convergence properties provide an alternative proof for some of the properties of steering policies
Keywords :
Markov processes; decision theory; Markov decision processes; almost sure convergence; law of large number; law of large numbers; martingale differences; multidimensional stochastic approximation; steering policies; two-dimensional stochastic approximations; Convergence; Cost function; Extraterrestrial measurements; Kernel; Multidimensional systems; Random variables; Stability; Stochastic processes; Synthetic aperture sonar; Systems engineering and theory; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Decision and Control, 1992., Proceedings of the 31st IEEE Conference on
Conference_Location :
Tucson, AZ
Print_ISBN :
0-7803-0872-7
Type :
conf
DOI :
10.1109/CDC.1992.371017
Filename :
371017
Link To Document :
بازگشت