DocumentCode :
3642752
Title :
Implementation of Q — Learning algorithm for solving maze problem
Author :
D. Osmanković;S. Konjicija
Author_Institution :
Department of Automatic Control and Electronics, University in Sarajevo, Faculty of Electrical Engineering in Sarajevo, Zmaja od Bosne bb, 71000 Sarajevo, Bosnia and Herzegovina
fYear :
2011
fDate :
5/1/2011 12:00:00 AM
Firstpage :
1619
Lastpage :
1622
Abstract :
Machine learning is very important in several fields ranging from control systems to data mining. This paper presents Q - Learning implementation for abstract graph models with maze solving (finding the trajectory out of the maze) taken as example of graph problem. The paper consists of conversion of maze matrix to Q - Learning reward matrix, and also the implementation of Q - Learning algorithm for the reward matrix (similar to minimizing criteria matrix in dynamic programming). This implementation is on higher level of abstraction, so other representations can be used (artificial neural networks, tree etc.). For the testing of Q - Learning algorithm, maze solving problem was visualized in MATLAB programming language with the found trajectory marked on the maze. The maze in this paper is defined with starting position in the top left corner and the exit in the bottom right corner. The performance of the algorithm is measured for different scales of the problem.
Publisher :
ieee
Conference_Titel :
MIPRO, 2011 Proceedings of the 34th International Convention
Print_ISBN :
978-1-4577-0996-8
Type :
conf
Filename :
5967320
Link To Document :
بازگشت