مرکز منطقه ای اطلاع رساني علوم و فناوري - Learning from positive and negative rewards in a spiking neural network model of basal ganglia

DocumentCode :

2778354

Title :

Learning from positive and negative rewards in a spiking neural network model of basal ganglia

Author :

Jitsev, Jenia ; Morrison, Abigail ; Tittgemeyer, Marc

Author_Institution :

Dept. of Cortical Networks & Cognitive Functions, Max-Planck-Inst. for Neurological Res., Cologne, Germany

fYear :

2012

fDate :

10-15 June 2012

Firstpage :

Lastpage :

Abstract :

Despite the vast amount of experimental findings on the role of the basal ganglia in reinforcement learning, there is still general lack of network models that use spiking neurons and plausible plasticity mechanisms to demonstrate network-level reward-based learning. In this work we extend a recent spiking actor-critic network model of the basal ganglia, aiming to create a minimal realistic model of learning from both positive and negative rewards. We hypothesize and implement in the model segregation of not only the dorsal striatum, but also of the ventral striatum into populations of medium spiny neurons (MSNs) that carry either D1 or D2 dopamine (DA) receptor type. This segregation allows explicit representation of both positive and negative expected reward within respective population. In line with recent experiments, we further assume that D1 and D2 MSN populations have distinct, opposing DA-modulated bidirectional synaptic plasticity. We implement the spiking network model in the simulator NEST and conduct experiments involving application of delayed rewards in a grid world setting, where a moving agent has to reach a goal state while maximizing the total obtained reward. We demonstrate that the network can learn not only to approach the positive rewards, but also to consequently avoid punishments as opposed to the original model. The spiking network model highlights thus functional role of D1-D2 MSN segregation within striatum and explains necessity for reversed direction of DA-dependent plasticity found at synapses converging on different types of striatal MSNs.

Keywords :

neural nets; neurophysiology; DA-modulated bidirectional synaptic plasticity; basal ganglia; dopamine receptor; dorsal striatum; medium spiny neurons; negative reward; network-level reward-based learning; plasticity mechanisms; positive reward; reinforcement learning; spiking actor-critic network model; spiking neural network model; ventral striatum; Basal ganglia; Brain modeling; Computational modeling; Computer architecture; Learning; Mathematical model; Neurons;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Neural Networks (IJCNN), The 2012 International Joint Conference on

Conference_Location :

Brisbane, QLD

ISSN :

2161-4393

Print_ISBN :

978-1-4673-1488-6

Electronic_ISBN :

2161-4393

Type :

conf

DOI :

10.1109/IJCNN.2012.6252834

Filename :

6252834

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2778354