مرکز منطقه ای اطلاع رساني علوم و فناوري - On the convergence and ODE limit of a two-dimensional stochastic approximation

DocumentCode :

1126228

Title :

On the convergence and ODE limit of a two-dimensional stochastic approximation

Author :

Ma, Dye-Jyun ; Makowski, Armand M.

Author_Institution :

Nat. Chang-Hsing Univ., Taichung, Taiwan

Volume :

Issue :

fYear :

1994

fDate :

7/1/1994 12:00:00 AM

Firstpage :

1439

Lastpage :

1442

Abstract :

We consider a two-dimensional stochastic approximations scheme of the Robbins-Monro type which naturally arises in the study of steering policies for Markov decision processes. Making use of a decoupling change of variables, we establish its almost sure convergence by ad-hoc arguments that combine standard results on one-dimensional stochastic approximations with a version of the law of large numbers for martingale differences. We use this direct analysis to guide us in selecting the test function which appears in standard convergence results for multidimensional schemes. Furthermore, although a blind application of the ordinary differential equation (ODE) method is not possible here due to a lack of regularity properties, the aforementioned change of variables paves the way for an interpretation of the behavior of solutions to the associated limiting ODE

Keywords :

Markov processes; approximation theory; convergence of numerical methods; decision theory; differential equations; 2D stochastic approximation; Markov decision processes; Robbins-Monro type; convergence; martingale differences; multidimensional schemes; ordinary differential equation; Artificial intelligence; Automatic control; Controllability; Convergence; Eigenvalues and eigenfunctions; Linear systems; Multidimensional systems; Polynomials; Stochastic processes; Testing;

fLanguage :

English

Journal_Title :

Automatic Control, IEEE Transactions on

Publisher :

ieee

ISSN :

0018-9286

Type :

jour

DOI :

10.1109/9.299629

Filename :

299629

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1126228