مرکز منطقه ای اطلاع رساني علوم و فناوري - Guiding Autonomous Agents to Better Behaviors through Human Advice

DocumentCode :

3164515

Title :

Guiding Autonomous Agents to Better Behaviors through Human Advice

Author :

Kunapuli, Gautam ; Odom, Phillip ; Shavlik, Jude W. ; Natarajan, Sriraam

Author_Institution :

Dept. of Biostat. & Med. Inf., Univ. of Wisconsin-Madison, Madison, WI, USA

fYear :

2013

fDate :

7-10 Dec. 2013

Firstpage :

409

Lastpage :

418

Abstract :

Inverse Reinforcement Learning (IRL) is an approach for domain-reward discovery from demonstration, where an agent mines the reward function of a Markov decision process by observing an expert acting in the domain. In the standard setting, it is assumed that the expert acts (nearly) optimally, and a large number of trajectories, i.e., training examples are available for reward discovery (and consequently, learning domain behavior). These are not practical assumptions: trajectories are often noisy, and there can be a paucity of examples. Our novel approach incorporates advice-giving into the IRL framework to address these issues. Inspired by preference elicitation, a domain expert provides advice on states and actions (features) by stating preferences over them. We evaluate our approach on several domains and show that with small amounts of targeted preference advice, learning is possible from noisy demonstrations, and requires far fewer trajectories compared to simply learning from trajectories alone.

Keywords :

Markov processes; decision theory; learning (artificial intelligence); software agents; IRL framework; Markov decision process; autonomous agents; domain expert; domain-reward discovery; human advice; inverse reinforcement learning; learning domain behavior; preference elicitation; reward function; Data mining; Educational institutions; Equations; Learning (artificial intelligence); Noise measurement; Training; Trajectory;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Data Mining (ICDM), 2013 IEEE 13th International Conference on

Conference_Location :

Dallas, TX

ISSN :

1550-4786

Type :

conf

DOI :

10.1109/ICDM.2013.79

Filename :

6729525

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3164515