• DocumentCode
    14608
  • Title

    Action Priors for Learning Domain Invariances

  • Author

    Rosman, Benjamin ; Ramamoorthy, Subramanian

  • Author_Institution
    Mobile Intell. Autonomous Syst., Pretoria, South Africa
  • Volume
    7
  • Issue
    2
  • fYear
    2015
  • fDate
    Jun-15
  • Firstpage
    107
  • Lastpage
    118
  • Abstract
    An agent tasked with solving a number of different decision making problems in similar environments has an opportunity to learn over a longer timescale than each individual task. Through examining solutions to different tasks, it can uncover behavioral invariances in the domain, by identifying actions to be prioritized in local contexts, invariant to task details. This information has the effect of greatly increasing the speed of solving new problems. We formalise this notion as action priors, defined as distributions over the action space, conditioned on environment state, and show how these can be learnt from a set of value functions. We apply action priors in the setting of reinforcement learning, to bias action selection during exploration. Aggressive use of action priors performs context based pruning of the available actions, thus reducing the complexity of lookahead during search. We additionally define action priors over observation features, rather than states, which provides further flexibility and generalizability, with the additional benefit of enabling feature selection. Action priors are demonstrated in experiments in a simulated factory environment and a large random graph domain, and show significant speed ups in learning new tasks. Furthermore, we argue that this mechanism is cognitively plausible, and is compatible with findings from cognitive psychology.
  • Keywords
    decision making; feature selection; learning (artificial intelligence); action priors; action selection; behavioral invariances; cognitive psychology; context based pruning; decision making problems; domain invariance learning; feature selection; lookahead complexity; reinforcement learning; Buildings; Context; Decision making; Equations; Learning (artificial intelligence); Robots; Training; Action ordering; action selection; reinforcement learning; search pruning; transfer learning;
  • fLanguage
    English
  • Journal_Title
    Autonomous Mental Development, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1943-0604
  • Type

    jour

  • DOI
    10.1109/TAMD.2015.2419715
  • Filename
    7079487