مرکز منطقه ای اطلاع رساني علوم و فناوري - On expediency and convergence in variable structure automata

DocumentCode :

2991023

Title :

On expediency and convergence in variable structure automata

Author :

Chandrasekaran, B. ; C. Shen, D.

Author_Institution :

University of Pennsylvania, Philadelphia, Pa.

fYear :

1966

fDate :

3-3 Oct. 1966

Firstpage :

845

Lastpage :

850

Abstract :

A variable structure stochastic automaton responds to the penalties from a random environment by changing its state probability distribution through a reinforcement scheme. This paper discusses the efficiency of learning for a 2-state automaton in terms of expediency and convergence under two types of nonlinear reinforcement schemes, one based on penalty probabilities and the other on penalty strengths. The stability of the asymptotic expected values of the state probability is discussed in detail. The conditions for achieving optimal, and expedient behavior of the automaton are derived. Convergence is discussed in the light of variance analysis. Learning curves can be obtained by solving nonlinear difference equations relating successive expected values, while for the linear case an analytic expression is derived. Finally transformation of penalty strengths to improve asymptotic state probability separation is considered.

Keywords :

Adaptive control; Automatic control; Convergence; Learning automata; Programmable control; Stochastic processes;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Adaptive Processes, 1966. Fifth Symposium on

Conference_Location :

USA

Type :

conf

DOI :

10.1109/SAP.1966.271166

Filename :

4043693

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2991023