DocumentCode :
479768
Title :
An Adaptive Approach for the Exploration-Exploitation Dilemma in Non-stationary Environment
Author :
SHEN, Yuanxia ; Zeng, Chuanhua
Author_Institution :
Dept. of Math. & Comput. Sci., Chongqing Univ. of Arts & Sci., Chongqing
Volume :
1
fYear :
2008
fDate :
12-14 Dec. 2008
Firstpage :
497
Lastpage :
500
Abstract :
A central problem in reinforcement learning is balancing exploration-exploitation dilemma in non-stationary environment. To address this problem, a data-driven Q-learning is presented. In this study, firstly, the information system of behavior is formed by experience of agent. Then the trigger mechanism of environment is constructed to trace changes of environment by uncertain knowledge of information system. The dynamic information of environment is used to balance exploration-exploitation dilemma with self-driven way. We illustrated this algorithm with grid-world navigation tasks. The results of simulated experiments show that this algorithm improves learning efficiency obviously.
Keywords :
learning (artificial intelligence); data-driven Q-learning; exploration-exploitation dilemma; information system; nonstationary environment; reinforcement learning; Art; Computational modeling; Computer science; Industrial control; Information systems; Iterative algorithms; Learning; Mathematics; Navigation; Software engineering; Q-learning; data-driven; reinforcement learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Science and Software Engineering, 2008 International Conference on
Conference_Location :
Wuhan, Hubei
Print_ISBN :
978-0-7695-3336-0
Type :
conf
DOI :
10.1109/CSSE.2008.677
Filename :
4721795
Link To Document :
بازگشت