مرکز منطقه ای اطلاع رساني علوم و فناوري - Suggestion of probabilistic reward-independent knowledge for dynamic environment in reinforcement learning

DocumentCode :

2631492

Title :

Suggestion of probabilistic reward-independent knowledge for dynamic environment in reinforcement learning

Author :

Shibuya, Nodoka ; Miyazaki, Yoshiki ; Kurashige, Kentarou

Author_Institution :

Dept. of Inf. & Electron., Muroran Inst. of Technol., Muroran, Japan

fYear :

2011

fDate :

6-9 Nov. 2011

Firstpage :

140

Lastpage :

145

Abstract :

Recently, reinforcement learning attracts attention as the learning technique that is often used on actual robot. As one of problems of reinforcement learning, it is difficult for reinforcement learning to cope with changing purpose, because reinforcement learning depend on reward. Until now, we suggested that we learned to use information does not depend on reward for solving the problem. This information is environmental transition. We defined this information as “Reward-Independent Knowledge (RIK)”. A robot gets RIK and predicts route from initial state to purpose state by using RIK. Reinforcement learning can cope with changing purpose by using RIK. However, it is difficult for RIK to cope with dynamic environment, because RIK is one to one correspondence between state-action pair and next state. Therefore, we suggest that RIK has multiple next state and probability of each possible next state. In this paper, we perform an experiment by simulation. We show that suggested knowledge copes with changing purpose and dynamic environment. In this experiment, we adopt a maze problem which a goal change and changing structure of maze. By this, we will show that suggested knowledge can cope with changing purpose and dynamic environment.

Keywords :

learning (artificial intelligence); robots; actual robot; dynamic environment; environmental transition; learning technique; maze problem; probabilistic reward-independent knowledge; reinforcement learning; Equations; Humans; Learning; Mathematical model; Probabilistic logic; Probability; Robots;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Micro-NanoMechatronics and Human Science (MHS), 2011 International Symposium on

Conference_Location :

Nagoya

ISSN :

Pending

Print_ISBN :

978-1-4577-1360-6

Type :

conf

DOI :

10.1109/MHS.2011.6102175

Filename :

6102175

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2631492