مرکز منطقه ای اطلاع رساني علوم و فناوري - A reinforcement learning neural network for adaptive control of Markov chains

DocumentCode :

1492230

Title :

A reinforcement learning neural network for adaptive control of Markov chains

Author :

Santharam, G. ; Sastry, P.S.

Author_Institution :

Motorola India Electron., Bangalore, India

Volume :

Issue :

fYear :

1997

fDate :

9/1/1997 12:00:00 AM

Firstpage :

588

Lastpage :

600

Abstract :

In this paper we consider the problem of reinforcement learning in a dynamically changing environment. In this context, we study the problem of adaptive control of finite-state Markov chains with a finite number of controls. The transition and payoff structures are unknown. The objective is to find an optimal policy which maximizes the expected total discounted payoff over the infinite horizon. A stochastic neural network model is suggested for the controller. The parameters of the neural net, which determine a random control strategy, are updated at each instant using a simple learning scheme. This learning scheme involves estimation of some relevant parameters using an adaptive critic. It is proved that the controller asymptotically chooses an optimal action in each state of the Markov chain with a high probability

Keywords :

Markov processes; adaptive systems; learning (artificial intelligence); learning automata; neural nets; probabilistic automata; probability; Markov chains; adaptive control; adaptive critic; expected total discounted payoff; learning automata; probability; reinforcement learning; stochastic neural network; Adaptive control; Adaptive systems; Biological system modeling; Infinite horizon; Learning automata; Learning systems; Neural networks; Optimal control; Pattern classification; Stochastic processes;

fLanguage :

English

Journal_Title :

Systems, Man and Cybernetics, Part A: Systems and Humans, IEEE Transactions on

Publisher :

ieee

ISSN :

1083-4427

Type :

jour

DOI :

10.1109/3468.618258

Filename :

618258

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1492230