مرکز منطقه ای اطلاع رساني علوم و فناوري - A connectionist actor-critic algorithm for faster learning and biological plausibility

DocumentCode :

250731

Title :

A connectionist actor-critic algorithm for faster learning and biological plausibility

Author :

Johard, Leonard ; Ruffaldi, Emanuele

Author_Institution :

Scuola Superiore S. Anna, PERCRO, Pisa, Italy

fYear :

2014

fDate :

May 31 2014-June 7 2014

Firstpage :

3903

Lastpage :

3909

Abstract :

We propose a novel biologically plausible actor-critic algorithm using policy gradients in order to achieve practical, model-free reinforcement learning. It does not rely on backpropagation and is the first neural actor-critic relying only on locally available information. We show it has an advantage over pure policy gradients methods for motor learning performance in the polecart problem. We are also able to closely simulate the dopaminergic signaling patterns in rats when confronted with a two cue problem, showing that local, connectionist models can effectively model the functioning of the intrinsic reward system.

Keywords :

biology computing; gradient methods; learning (artificial intelligence); neural nets; biologically plausible actor-critic algorithm; connectionist actor-critic algorithm; dopaminergic signaling patterns; intrinsic reward system; model-free reinforcement learning; neural actor-critic; polecart problem; policy gradients; Backpropagation; Biological system modeling; Learning (artificial intelligence); Neurons; Supervised learning; Training;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Robotics and Automation (ICRA), 2014 IEEE International Conference on

Conference_Location :

Hong Kong

Type :

conf

DOI :

10.1109/ICRA.2014.6907425

Filename :

6907425

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=250731