Temporal difference learning of N-tuple networks for the game 2048

Author

Szubert, Marcin ; Jaskowski, Wojciech

Author_Institution

Inst. of Comput. Sci., Poznan Univ. of Technol., Poznan, Poland

fYear

2014

fDate

26-29 Aug. 2014

Firstpage

1

Lastpage

8

Abstract

The highly addictive stochastic puzzle game 2048 has recently invaded the Internet and mobile devices, stealing countless hours of players´ lives. In this study we investigate the possibility of creating a game-playing agent capable of winning this game without incorporating human expertise or performing game tree search. For this purpose, we employ three variants of temporal difference learning to acquire i) action value, ii) state value, and iii) afterstate value functions for evaluating player moves at 1-ply. To represent these functions we adopt n-tuple networks, which have recently been successfully applied to Othello and Connect 4. The conducted experiments demonstrate that the learning algorithm using afterstate value functions is able to consistently produce players winning over 97% of games. These results show that n-tuple networks combined with an appropriate learning algorithm have large potential, which could be exploited in other board games.

Keywords

Internet; computer games; learning (artificial intelligence); mobile computing; software agents; stochastic games; tree searching; Connect 4; I-ply; Internet; Othello; afterstate value functions; board games; game tree search; game-playing agent; human expertise; learning algorithm; mobile devices; n-tuple networks; player move evaluation; state value; stochastic puzzle game 2048; temporal difference learning; Games; game 2048; n-tuple networks; position evaluation function; reinforcement learning; temporal difference learning;

fLanguage

English

Publisher

ieee

Conference_Titel

Computational Intelligence and Games (CIG), 2014 IEEE Conference on

Conference_Location

Dortmund

Type

conf

DOI

10.1109/CIG.2014.6932907

Filename

6932907