• DocumentCode
    479768
  • Title

    An Adaptive Approach for the Exploration-Exploitation Dilemma in Non-stationary Environment

  • Author

    SHEN, Yuanxia ; Zeng, Chuanhua

  • Author_Institution
    Dept. of Math. & Comput. Sci., Chongqing Univ. of Arts & Sci., Chongqing
  • Volume
    1
  • fYear
    2008
  • fDate
    12-14 Dec. 2008
  • Firstpage
    497
  • Lastpage
    500
  • Abstract
    A central problem in reinforcement learning is balancing exploration-exploitation dilemma in non-stationary environment. To address this problem, a data-driven Q-learning is presented. In this study, firstly, the information system of behavior is formed by experience of agent. Then the trigger mechanism of environment is constructed to trace changes of environment by uncertain knowledge of information system. The dynamic information of environment is used to balance exploration-exploitation dilemma with self-driven way. We illustrated this algorithm with grid-world navigation tasks. The results of simulated experiments show that this algorithm improves learning efficiency obviously.
  • Keywords
    learning (artificial intelligence); data-driven Q-learning; exploration-exploitation dilemma; information system; nonstationary environment; reinforcement learning; Art; Computational modeling; Computer science; Industrial control; Information systems; Iterative algorithms; Learning; Mathematics; Navigation; Software engineering; Q-learning; data-driven; reinforcement learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Software Engineering, 2008 International Conference on
  • Conference_Location
    Wuhan, Hubei
  • Print_ISBN
    978-0-7695-3336-0
  • Type

    conf

  • DOI
    10.1109/CSSE.2008.677
  • Filename
    4721795