مرکز منطقه ای اطلاع رساني علوم و فناوري

DocumentCode :

1836983

Title :

Dopamine and inference about timing

Author :

Daw, Nathaniel D. ; Courville, Aaron C. ; Touretzky, David S.

fYear :

2002

fDate :

2002

Firstpage :

271

Lastpage :

276

Abstract :

Several investigators have suggested that the primate dopamine system carries an error signal for learning to predict future rewards. These models, based on temporal-difference (TD) learning, explain most phasic responses of primate dopamine neurons in appetitive conditioning; moreover, they suggest a neurophysiological account of animal conditioning behavior. But because existing models are based in the simple formal setting of Markov processes, they are deficient in at least two areas relevant to physiological and behavioral data. They do not provide a realistic account of the partial observability of the state of the world, nor of how the system tracks the timing of events. In this paper, we introduce a version of TD learning grounded in a richer formal model to better address both issues and, consequently, to explain some data that challenge existing models.

Keywords :

Markov processes; behavioural sciences; inference mechanisms; learning (artificial intelligence); neural nets; neurophysiology; Markov processes; TD learning; animal conditioning behavior; appetitive conditioning; dopamine; neurophysiology; partial observability; phasic responses; primate dopamine neurons; reward prediction; temporal-difference learning; timing inference; Animals; Ash; Cognitive robotics; Computer science; Intersymbol interference; Markov processes; Neurons; Observability; Predictive models; Timing;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Development and Learning, 2002. Proceedings. The 2nd International Conference on

Print_ISBN :

0-7695-1459-6

Type :

conf

DOI :

10.1109/DEVLRN.2002.1011901

Filename :

1011901

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1836983