DocumentCode :
833160
Title :
Combination of online clustering and Q-value based GA for reinforcement fuzzy system design
Author :
Juang, Chia-Feng
Author_Institution :
Dept. of Electr. Eng., Nat. Chung Hsing Univ., Taichung, Taiwan
Volume :
13
Issue :
3
fYear :
2005
fDate :
6/1/2005 12:00:00 AM
Firstpage :
289
Lastpage :
302
Abstract :
This paper proposes a combination of online clustering and Q-value based genetic algorithm (GA) learning scheme for fuzzy system design (CQGAF) with reinforcements. The CQGAF fulfills GA-based fuzzy system design under reinforcement learning environment where only weak reinforcement signals such as "success" and "failure" are available. In CQGAF, there are no fuzzy rules initially. They are generated automatically. The precondition part of a fuzzy system is online constructed by an aligned clustering-based approach. By this clustering, a flexible partition is achieved. Then, the consequent part is designed by Q-value based genetic reinforcement learning. Each individual in the GA population encodes the consequent part parameters of a fuzzy system and is associated with a Q-value. The Q-value estimates the discounted cumulative reinforcement information performed by the individual and is used as a fitness value for GA evolution. At each time step, an individual is selected according to the Q-values, and then a corresponding fuzzy system is built and applied to the environment with a critic received. With this critic, Q-learning with eligibility trace is executed. After each trial, GA is performed to search for better consequent parameters based on the learned Q-values. Thus, in CQGAF, evolution is performed immediately after the end of one trial in contrast to general GA where many trials are performed before evolution. The feasibility of CQGAF is demonstrated through simulations in cart-pole balancing, magnetic levitation, and chaotic system control problems with only binary reinforcement signals.
Keywords :
fuzzy systems; genetic algorithms; learning (artificial intelligence); Q-value based genetic algorithms; cart-pole balancing; chaotic system control; magnetic levitation; online clustering; reinforcement fuzzy system design; reinforcement learning; Algorithm design and analysis; Chaos; Fuzzy control; Fuzzy logic; Fuzzy neural networks; Fuzzy systems; Genetic algorithms; Magnetic levitation; Signal design; Supervised learning; Flexible partition; Q-learning; fuzzy control; reinforcement learning; temporal difference;
fLanguage :
English
Journal_Title :
Fuzzy Systems, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6706
Type :
jour
DOI :
10.1109/TFUZZ.2004.841726
Filename :
1439517
Link To Document :
بازگشت