مرکز منطقه ای اطلاع رساني علوم و فناوري - Simple Artificial Neural Networks That Match Probability and Exploit and Explore When Confronting a Multiarmed Bandit

DocumentCode :

1131430

Title :

Simple Artificial Neural Networks That Match Probability and Exploit and Explore When Confronting a Multiarmed Bandit

Author :

Dawson, Michael R W ; Dupuis, Brian ; Spetch, Marcia L. ; Kelly, Debbie M.

Author_Institution :

Dept. of Psychol., Univ. of Alberta, Edmonton, AB, Canada

Volume :

Issue :

fYear :

2009

Firstpage :

1368

Lastpage :

1371

Abstract :

The matching law (Herrnstein 1961) states that response rates become proportional to reinforcement rates; this is related to the empirical phenomenon called probability matching (Vulkan 2000). Here, we show that a simple artificial neural network generates responses consistent with probability matching. This behavior was then used to create an operant procedure for network learning. We use the multiarmed bandit (Gittins 1989), a classic problem of choice behavior, to illustrate that operant training balances exploiting the bandit arm expected to pay off most frequently with exploring other arms. Perceptrons provide a medium for relating results from neural networks, genetic algorithms, animal learning, contingency theory, reinforcement learning, and theories of choice.

Keywords :

learning (artificial intelligence); neural nets; probability; artificial neural networks; matching law; multiarmed bandit; network learning; probability matching; Instrumental learning; multiarmed bandit; operant conditioning; perceptron; probability matching; Animals; Artificial Intelligence; Choice Behavior; Computer Simulation; Conditioning, Operant; Humans; Neural Networks (Computer); Probability; Reinforcement (Psychology); Reinforcement Schedule;

fLanguage :

English

Journal_Title :

Neural Networks, IEEE Transactions on

Publisher :

ieee

ISSN :

1045-9227

Type :

jour

DOI :

10.1109/TNN.2009.2025588

Filename :

5161343

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1131430