• DocumentCode
    2827563
  • Title

    Adaptive critic design in learning to play game of Go

  • Author

    Zaman, Raonak ; Prokhorov, Danil ; Wunsch, Donald C., II

  • Author_Institution
    Dept. of Electr. Eng., Texas Tech. Univ., Lubbock, TX, USA
  • Volume
    1
  • fYear
    1997
  • fDate
    9-12 Jun 1997
  • Firstpage
    1
  • Abstract
    This paper examines the performance of an HDP-type adaptive critic design (ACD) of the game Go. The game Go is an ideal problem domain for exploring machine learning; it has simple rules but requires complex strategies to play well. All current commercial Go programs are knowledge based implementations; they utilize input feature and pattern matching along with minimax type search techniques. But the extremely high branching factor puts a limit on their capabilities, and they are very weak compared to the relative strengths of other game programs like chess. In this paper, the Go-playing ACD consists of a critic network and an action network. The HDP type critic network learns to predict the cumulative utility function of the current board position from training games, and, the action network chooses a next move which maximizes critics next step cost-to-go. After about 6000 different training games against a public domain program, WALLY, the network (playing WHITE) began to win in some of the games and showed slow but steady improvements on test games
  • Keywords
    games of skill; learning (artificial intelligence); neural net architecture; action network; adaptive critic design; critic network; cumulative utility function; game of Go; learning; Computational intelligence; Humans; Laboratories; Law; Legal factors; Machine learning; Minimax techniques; Pattern matching; Prediction algorithms; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks,1997., International Conference on
  • Conference_Location
    Houston, TX
  • Print_ISBN
    0-7803-4122-8
  • Type

    conf

  • DOI
    10.1109/ICNN.1997.611623
  • Filename
    611623