DocumentCode :
2610402
Title :
Control Double Inverted Pendulum by Reinforcement Learning with Double CMAC Network
Author :
Zheng, Yu ; Luo, Siwei ; Lv, Ziang
Author_Institution :
Sch. of Comput. & Inf. Technol., Beijing Jiaotong Univ.
Volume :
4
fYear :
0
fDate :
0-0 0
Firstpage :
639
Lastpage :
642
Abstract :
To accelerate the learning of reinforcement learning, many types of function approximation are used to represent state value. However function approximation reduces the accuracy of state value, and brings difficulty in the convergence. To solve the problems of tradeoff between the generalization and accuracy in reinforcement learning, we represent state-action value by two CMAC networks with different generalization parameters. The accuracy CMAC network can represent values exactly, which achieves precise control in the states around target area. And the generalization CMAC network can extend experiences to unknown area, and guide the learning of accuracy CMAC network. The algorithm proposed in this paper can effectively avoid the dilemma of achieving tradeoff between generalization and accuracy. Simulation results for the control of double inverted pendulum are presented to show effectiveness of the proposed algorithm
Keywords :
cerebellar model arithmetic computers; generalisation (artificial intelligence); learning (artificial intelligence); neurocontrollers; nonlinear control systems; pendulums; double CMAC network; double inverted pendulum control; function approximation; generalization; reinforcement learning; state-action value; Acceleration; Approximation algorithms; Computer networks; Control systems; Convergence; Function approximation; Iterative algorithms; Learning; Optimal control; State-space methods;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition, 2006. ICPR 2006. 18th International Conference on
Conference_Location :
Hong Kong
ISSN :
1051-4651
Print_ISBN :
0-7695-2521-0
Type :
conf
DOI :
10.1109/ICPR.2006.416
Filename :
1699922
Link To Document :
بازگشت