مرکز منطقه ای اطلاع رساني علوم و فناوري

DocumentCode :

3121831

Title :

Self-modifying reinforcement learning

Author :

Zhao, Jie-Yu

Author_Institution :

IST Res., Ningbo Univ., Zhejiang, China

Volume :

fYear :

2002

fDate :

4-5 Nov. 2002

Firstpage :

2146

Abstract :

We describe several experiments with reinforcement learning systems based on the technique of incremental self-improvement (IS). IS uses the success-story algorithm (SSA) to undo unrewarding policy changes computed by self-modifying policies. The experiment demonstrates IS´ advantages over stochastic hill climbing and TD Q-learning in noisy environments given limited computational resources.

Keywords :

learning (artificial intelligence); learning automata; stochastic automata; TD Q-learning; incremental self-improvement; noisy environments; self-modifying reinforcement learning; stochastic hill climbing; success-story algorithm; Acceleration; Genetic algorithms; Learning; Monitoring; Noise measurement; Performance evaluation; Stochastic processes; Testing; Time measurement; Working environment noise;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Machine Learning and Cybernetics, 2002. Proceedings. 2002 International Conference on

Print_ISBN :

0-7803-7508-4

Type :

conf

DOI :

10.1109/ICMLC.2002.1175418

Filename :

1175418

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3121831