An Adaptive Approach for the Exploration-Exploitation Dilemma in Non-stationary Environment

Author

SHEN, Yuanxia ; Zeng, Chuanhua

Author_Institution

Dept. of Math. & Comput. Sci., Chongqing Univ. of Arts & Sci., Chongqing

Volume

1

fYear

2008

fDate

12-14 Dec. 2008

Firstpage

497

Lastpage

500

Abstract

A central problem in reinforcement learning is balancing exploration-exploitation dilemma in non-stationary environment. To address this problem, a data-driven Q-learning is presented. In this study, firstly, the information system of behavior is formed by experience of agent. Then the trigger mechanism of environment is constructed to trace changes of environment by uncertain knowledge of information system. The dynamic information of environment is used to balance exploration-exploitation dilemma with self-driven way. We illustrated this algorithm with grid-world navigation tasks. The results of simulated experiments show that this algorithm improves learning efficiency obviously.

Keywords

learning (artificial intelligence); data-driven Q-learning; exploration-exploitation dilemma; information system; nonstationary environment; reinforcement learning; Art; Computational modeling; Computer science; Industrial control; Information systems; Iterative algorithms; Learning; Mathematics; Navigation; Software engineering; Q-learning; data-driven; reinforcement learning;

fLanguage

English

Publisher

ieee

Conference_Titel

Computer Science and Software Engineering, 2008 International Conference on

Conference_Location

Wuhan, Hubei

Print_ISBN

978-0-7695-3336-0

Type

conf

DOI

10.1109/CSSE.2008.677

Filename

4721795