Title :
Competitive reinforcement learning for combinatorial problems
Author :
Abramson, Myriam ; Wechsler, Harry
Author_Institution :
George Mason Univ., Fairfax, VA, USA
Abstract :
This paper shows that the competitive learning rule found in learning vector quantization (LVQ) serves as a promising function approximator to enable reinforcement learning methods to cope with a large decision search space, defined in terms of difference classes of input patterns, like those found in the game of Go. This paper describes S[arsa]LVQ, a novel reinforcement learning algorithm and shows its feasibility for Go. As the distributed LVQ representation corresponds to a (quantized) codebook of compressed and generalized pattern templates, the state space requirements for online reinforcement methods are significantly reduced, thus decreasing the complexity of the decision space and consequently improving the play performance. As a result of competitive learning, SLVQ can win against heuristic players and starts to level off against stronger opponents such as Wally. SLVQ outperforms S[arsa]Linear when playing against both a heuristic player and Wally. Furthermore, while playing Go, SLVQ learns to stay alive while SLinear fails to do so
Keywords :
function approximation; games of skill; search problems; self-organising feature maps; unsupervised learning; vector quantisation; GO game; codebook; competitive learning; data compression; function approximation; learning vector quantization; reinforcement learning; search space; self organisation; Delay; Joining processes; Machine learning; Shape; State-space methods; Vector quantization;
Conference_Titel :
Neural Networks, 2001. Proceedings. IJCNN '01. International Joint Conference on
Conference_Location :
Washington, DC
Print_ISBN :
0-7803-7044-9
DOI :
10.1109/IJCNN.2001.938727