• DocumentCode
    31053
  • Title

    General Self-Motivation and Strategy Identification: Case Studies Based on Sokoban and Pac-Man

  • Author

    Anthony, Tiffany ; Polani, Daniel ; Nehaniv, Chrystopher L.

  • Author_Institution
    Adaptive Syst. Res. Group, Univ. of Hertfordshire, Hatfield, UK
  • Volume
    6
  • Issue
    1
  • fYear
    2014
  • fDate
    Mar-14
  • Firstpage
    1
  • Lastpage
    17
  • Abstract
    In this paper, we use empowerment, a recently introduced biologically inspired measure, to allow an AI player to assign utility values to potential future states within a previously unencountered game without requiring explicit specification of goal states. We further introduce strategic affinity, a method of grouping action sequences together to form “strategies,” by examining the overlap in the sets of potential future states following each such action sequence. We also demonstrate an information-theoretic method of predicting future utility. Combining these methods, we extend empowerment to soft-horizon empowerment which enables the player to select a repertoire of action sequences that aim to maintain anticipated utility. We show how this method provides a proto-heuristic for nonterminal states prior to specifying concrete game goals, and propose it as a principled candidate model for “intuitive” strategy selection, in line with other recent work on “self-motivated agent behavior.” We demonstrate that the technique, despite being generically defined independently of scenario, performs quite well in relatively disparate scenarios, such as a Sokoban-inspired box-pushing scenario and in a Pac-Man-inspired predator game, suggesting novel and principle-based candidate routes toward more general game-playing algorithms.
  • Keywords
    artificial intelligence; computer games; information theory; AI player; Pac-Man-inspired predator game; Sokoban-inspired box-pushing scenario; anticipated utility; biologically inspired measure; concrete game goal; future utility; game-playing algorithm; information-theoretic method; intuitive strategy selection; nonterminal states; principle-based candidate; principled candidate model; self-motivated agent behavior; self-motivation; soft-horizon empowerment; strategic affinity; strategy identification; utility values; Artificial intelligence; Cognition; Computers; Entropy; Games; Mutual information; Random variables; Artificial intelligence (AI); games; information theory;
  • fLanguage
    English
  • Journal_Title
    Computational Intelligence and AI in Games, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1943-068X
  • Type

    jour

  • DOI
    10.1109/TCIAIG.2013.2295372
  • Filename
    6687219