مرکز منطقه ای اطلاع رساني علوم و فناوري - Complex problem solving with reinforcement learning

DocumentCode :

1861576

Title :

Complex problem solving with reinforcement learning

Author :

Dandurand, Frédéric ; Shultz, Thomas R. ; Rivest, Francois

Author_Institution :

McGill Univ., Montreal

fYear :

2007

fDate :

11-13 July 2007

Firstpage :

157

Lastpage :

162

Abstract :

We previously measured human performance on a complex problem-solving task that involves finding which ball in a set is lighter or heavier than the others with a limited number of weightings. None of the participants found a correct solution within 30 minutes without help of demonstrations or instructions. In this paper, we model human performance on this task using a biologically plausible computational model based on reinforcement learning. We use a SARSA-based Softmax learning algorithm where the reward function is learned using cascade-correlation neural networks. First, we find that the task can be learned by reinforcement alone with substantial training. Second, we study the number of alternative actions available to Softmax and find that 5 works well for this problem which is compatible with estimates of human working memory size. Third, we find that simulations are less accurate than humans given equivalent amount of training We suggest that humans use means-ends analysis to self-generate rewards in non-terminal states. Implementing such self-generated rewards might improve model accuracy. Finally, we pretrain models to prefer simple actions, like humans. We partially capture a simplicity bias, and find that it had little impact on accuracy.

Keywords :

learning (artificial intelligence); neural nets; problem solving; SARSA-based Softmax learning algorithm; biologically plausible computational model; cascade-correlation neural networks; complex problem solving; means-ends analysis; reinforcement learning; substantial training; Biological system modeling; Biology computing; Cognition; Computational modeling; Feedback; Humans; Information processing; Learning; Problem-solving; Psychology; Complex Cognition; Problem Solving; Reinforcement Learning;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Development and Learning, 2007. ICDL 2007. IEEE 6th International Conference on

Conference_Location :

London

Print_ISBN :

978-1-4244-1116-0

Electronic_ISBN :

978-1-4244-1116-0

Type :

conf

DOI :

10.1109/DEVLRN.2007.4354026

Filename :

4354026

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1861576