Title :
A Reinfrocement Learning Approach to Online Learning in Control
Author :
Tehrani, Ali M. ; Kamel, Mohamed S.
Author_Institution :
Pattern Analysis and Machine Intelligence Lab, Systems Design Engineering Department, University of Waterloo, Waterloo, ON, Canada N2L 3G1. atehrani@pami.uwaterloo.ca
Abstract :
In search for a reinforcement learning technique with the simplicity of tabular techniques and capability of dealing with continuous states like in fuzzy systems, we introduce a new approach, which we have called Adaptive Pseudo Fuzzy technique. The idea is to partition the state space with some window functions similar to fuzzy membership functions and learn the output values by changing the rules’ consequents directly. This is a kind of local function approximation similar to RBFs and CMACs, however, unlike RBFs it does not shift the basis, and unlike CMACs it does not go through complex coarse coding. This method represents the input as a fuzzy system and learns the output as a look-up-table. We employed this technique in Sarsa(λ) to build a general online controller and examined it for a pole-balancing problem.
Conference_Titel :
Control and Automation, 2003. ICCA '03. Proceedings. 4th International Conference on
Conference_Location :
Montreal, Que., Canada
Print_ISBN :
0-7803-7777-X
DOI :
10.1109/ICCA.2003.1595047