• DocumentCode
    2446829
  • Title

    Estimating learning rates in evolution and TDL: Results on a simple grid-world problem

  • Author

    Lucas, Simon M.

  • Author_Institution
    Sch. of Comput. Sci. & Electron. Eng., Univ. of Essex, Colchester, UK
  • fYear
    2010
  • fDate
    18-21 Aug. 2010
  • Firstpage
    372
  • Lastpage
    379
  • Abstract
    When learning to play a game or perform some task, it is important to learn as quickly and effectively as possible by making best use of the available information. Interesting insights can be gained by studying the learning process from an information theory perspective, and analysing the learning speed in terms of the maximum number of bits that could be learned per game/task, or per action. Previous work has applied this analysis to co-evolution and to temporal difference learning (TDL) for a simple board game with a fixed number of moves. This paper analyses a grid-world problem and calculates the upper bounds on the information rates for evolution and for TDL. The results show an interesting relationship between the upper bounds of the learning rates and the actual information acquisition rates that are achieved in practice. Also, which method works best is highly dependent on the choice of function approximator.
  • Keywords
    computer games; evolutionary computation; function approximation; knowledge acquisition; learning (artificial intelligence); TDL; board game; evolutionary algorithm; function approximator; grid world problem; information acquisition rate; information theory; temporal difference learning; Equations; Error probability; Evolutionary computation; Games; Information rates; Mathematical model; Upper bound;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Games (CIG), 2010 IEEE Symposium on
  • Conference_Location
    Dublin
  • Print_ISBN
    978-1-4244-6295-7
  • Electronic_ISBN
    978-1-4244-6296-4
  • Type

    conf

  • DOI
    10.1109/ITW.2010.5593332
  • Filename
    5593332