Approximate Q-Learning: An Introduction

Author

Pandey, Deepshikha ; Pandey, Punit

Author_Institution

Dept. of Comput. Sci. & Eng., Jaypee Inst. of Eng. & Technol., Guna, India

fYear

2010

fDate

9-11 Feb. 2010

Firstpage

317

Lastpage

320

Abstract

This paper introduces an approach to Q-learning algorithm with rough set theory introduced by Zdzislaw Pawlak in 1981. During Q-learning, an agent makes action selections in an effort to maximize a reward signal obtained from the environment. Based on reward, agent will make changes in its policy for future actions. The problem considered in this paper is the overestimation of expected value of cumulative future discounted rewards. This discounted reward is used in evaluating agent actions and policy during reinforcement learning. Due to the overestimation of discounted reward action evaluation and policy changes are not accurate. The solution to this problem results from a form Q-learning algorithm using a combination of approximation spaces and Q-learning to estimate the expected value of returns on actions. This is made possible by considering behavior patterns of an agent in scope of approximation spaces. The framework provided by an approximation space makes it possible to measure the degree that agent behaviors are a part of (´´covered by´´) a set of accepted agent behaviors that serve as a behavior evaluation norm.

Keywords

learning (artificial intelligence); rough set theory; Q-learning algorithm; Zdzislaw Pawlak; behavior patterns; reinforcement learning; reward action evaluation; rough set theory; Approximation algorithms; Computer science; Extraterrestrial measurements; Learning systems; Machine learning; Machine learning algorithms; Optimization methods; Paper technology; Set theory; State estimation; Approximate Q-learning Algorithm; Overestimation of Q-Values; Q-learning Algorithm; Reinforcement Learning;

fLanguage

English

Publisher

ieee

Conference_Titel

Machine Learning and Computing (ICMLC), 2010 Second International Conference on

Conference_Location

Bangalore

Print_ISBN

978-1-4244-6006-9

Electronic_ISBN

978-1-4244-6007-6

Type

conf

DOI

10.1109/ICMLC.2010.38

Filename

5460718