مرکز منطقه ای اطلاع رساني علوم و فناوري - Inverse Reinforcement Learning using Expectation Maximization in mixture models

DocumentCode :

730575

Title :

Inverse Reinforcement Learning using Expectation Maximization in mixture models

Author :

Hahn, Jurgen ; Zoubir, Abdelhak M.

Author_Institution :

Signal Process. Group, Tech. Univ. Darmstadt, Darmstadt, Germany

fYear :

2015

fDate :

19-24 April 2015

Firstpage :

3721

Lastpage :

3725

Abstract :

Reinforcement Learning (RL) is an attractive tool for learning optimal controllers in the sense of a given reward function. In conventional RL, usually an expert is required to design the reward function as the efficiency of RL strongly depends on the latter. An alternative has been presented by the concept of Inverse Reinforcement Learning (IRL), where the reward function is estimated from observed data. In this work, we propose a novel approach for IRL based on a generative probabilistic model of RL. We derive an Expectation Maximization algorithm that is able to simultaneously estimate the reward and the optimal policy for finite state and action spaces, which can be easily extended for the infinite cases. By means of two toy examples, we show that the proposed algorithm works well even with a low number of observations and converges after only a few iterations.

Keywords :

expectation-maximisation algorithm; learning (artificial intelligence); mixture models; probability; IRL; action spaces; expectation maximization algorithm; finite state spaces; generative probabilistic model; inverse reinforcement learning; mixture models; optimal controllers; optimal policy; reward function; Integrated circuits; Integrated optics; Mixture models; Probabilistic logic; Expectation Maximization; Inverse Reinforcement Learning; Markov Decision Process;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on

Conference_Location :

South Brisbane, QLD

Type :

conf

DOI :

10.1109/ICASSP.2015.7178666

Filename :

7178666

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=730575