• DocumentCode
    352729
  • Title

    Genetic TD(λ) learning algorithm for policy evaluation problems

  • Author

    Xin, Xu ; Han-gen, He

  • Author_Institution
    Dept. of Autom. Control, Nat. Univ. of Defense Technol., Changsha, China
  • Volume
    1
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    500
  • Abstract
    In this paper, tabular TD(λ) learning algorithm is combined with genetic algorithm (GA) to solve stochastic policy evaluation problems. Unlike conventional TD(λ) algorithm which has fixed control parameters, the proposed genetic TD(λ) algorithm makes use of GA to optimize the control parameters while evaluating stochastic policies. Simulated experiments on stochastic policy evaluation problems show that genetic TD(λ) algorithms not only realize the auto-tuning of control parameters but also have improved performance
  • Keywords
    genetic algorithms; learning (artificial intelligence); learning systems; auto-tuning; genetic algorithm; optimisation; reinforcement learning; stochastic policy evaluation; Algorithm design and analysis; Automatic control; Computer science; Convergence; Electronic mail; Genetic algorithms; Helium; Machine learning; Machine learning algorithms; Stochastic processes;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Control and Automation, 2000. Proceedings of the 3rd World Congress on
  • Conference_Location
    Hefei
  • Print_ISBN
    0-7803-5995-X
  • Type

    conf

  • DOI
    10.1109/WCICA.2000.860017
  • Filename
    860017