DocumentCode
479768
Title
An Adaptive Approach for the Exploration-Exploitation Dilemma in Non-stationary Environment
Author
SHEN, Yuanxia ; Zeng, Chuanhua
Author_Institution
Dept. of Math. & Comput. Sci., Chongqing Univ. of Arts & Sci., Chongqing
Volume
1
fYear
2008
fDate
12-14 Dec. 2008
Firstpage
497
Lastpage
500
Abstract
A central problem in reinforcement learning is balancing exploration-exploitation dilemma in non-stationary environment. To address this problem, a data-driven Q-learning is presented. In this study, firstly, the information system of behavior is formed by experience of agent. Then the trigger mechanism of environment is constructed to trace changes of environment by uncertain knowledge of information system. The dynamic information of environment is used to balance exploration-exploitation dilemma with self-driven way. We illustrated this algorithm with grid-world navigation tasks. The results of simulated experiments show that this algorithm improves learning efficiency obviously.
Keywords
learning (artificial intelligence); data-driven Q-learning; exploration-exploitation dilemma; information system; nonstationary environment; reinforcement learning; Art; Computational modeling; Computer science; Industrial control; Information systems; Iterative algorithms; Learning; Mathematics; Navigation; Software engineering; Q-learning; data-driven; reinforcement learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Science and Software Engineering, 2008 International Conference on
Conference_Location
Wuhan, Hubei
Print_ISBN
978-0-7695-3336-0
Type
conf
DOI
10.1109/CSSE.2008.677
Filename
4721795
Link To Document