DocumentCode
2313108
Title
Approximate Q-Learning: An Introduction
Author
Pandey, Deepshikha ; Pandey, Punit
Author_Institution
Dept. of Comput. Sci. & Eng., Jaypee Inst. of Eng. & Technol., Guna, India
fYear
2010
fDate
9-11 Feb. 2010
Firstpage
317
Lastpage
320
Abstract
This paper introduces an approach to Q-learning algorithm with rough set theory introduced by Zdzislaw Pawlak in 1981. During Q-learning, an agent makes action selections in an effort to maximize a reward signal obtained from the environment. Based on reward, agent will make changes in its policy for future actions. The problem considered in this paper is the overestimation of expected value of cumulative future discounted rewards. This discounted reward is used in evaluating agent actions and policy during reinforcement learning. Due to the overestimation of discounted reward action evaluation and policy changes are not accurate. The solution to this problem results from a form Q-learning algorithm using a combination of approximation spaces and Q-learning to estimate the expected value of returns on actions. This is made possible by considering behavior patterns of an agent in scope of approximation spaces. The framework provided by an approximation space makes it possible to measure the degree that agent behaviors are a part of (´´covered by´´) a set of accepted agent behaviors that serve as a behavior evaluation norm.
Keywords
learning (artificial intelligence); rough set theory; Q-learning algorithm; Zdzislaw Pawlak; behavior patterns; reinforcement learning; reward action evaluation; rough set theory; Approximation algorithms; Computer science; Extraterrestrial measurements; Learning systems; Machine learning; Machine learning algorithms; Optimization methods; Paper technology; Set theory; State estimation; Approximate Q-learning Algorithm; Overestimation of Q-Values; Q-learning Algorithm; Reinforcement Learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Computing (ICMLC), 2010 Second International Conference on
Conference_Location
Bangalore
Print_ISBN
978-1-4244-6006-9
Electronic_ISBN
978-1-4244-6007-6
Type
conf
DOI
10.1109/ICMLC.2010.38
Filename
5460718
Link To Document