DocumentCode :
529339
Title :
Acquisition of deterministic exploration and purposive memory through reinforcement learning with a recurrent neural network
Author :
Goto, Kenta ; Shibata, Katsunari
Author_Institution :
Dept. of Electr. & Electron. Eng., Oita Univ., Oita, Japan
fYear :
2010
fDate :
18-21 Aug. 2010
Firstpage :
1943
Lastpage :
1949
Abstract :
The authors have propounded that various functions emerge purposively and harmoniously through reinforcement learning with a neural network. In this paper, emergence of deterministic ”exploration” behavior, which is different from the stochastic exploration and needs higher intelligence, is focused on. In order to realize the intelligent exploration behaviors, it becomes a key point whether the recurrent neural network memorizes necessary information and utilizes it to generate appropriate actions. In the simulation of 3 × 3 grid world with an invisible goal task, by introducing a recurrent neural network for Q-learning, an agent can represent more accurate Q-values considering the past experiences, and that is suggested to enable to learn appropriate actions. The acquired knowledge can be generalized in some unknown environment to some extent. In another task in a simple environment with a random-located branch, it is also shown that the recurrent neural network cleverly memorizes and keeps the branch position to represent accurate Q-values after learning.
Keywords :
knowledge acquisition; learning (artificial intelligence); recurrent neural nets; storage management; Q-learning; deterministic exploration acquisition; purposive memory; random located branch; recurrent neural network; reinforcement learning; Artificial neural networks; Learning; Learning systems; Neurons; Recurrent neural networks; Resource management; Robots; deterministic exploration; function emergence; memory; recurrent neural network; reinforcement learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
SICE Annual Conference 2010, Proceedings of
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-7642-8
Type :
conf
Filename :
5602574
Link To Document :
بازگشت