Title :
The apprentice modeling through reinforcement with a temporal analysis using the Q-learning algorithm
Author :
Guelpeli, Marcus Vinicius C ; Pinto, Márcia Aurélia ; De Oliveira, Bruno Santos ; Santos, Ruana Carpanzano dos
Author_Institution :
Centro Univ. de Barra Mansa (UBM), Barra Mansa, Brazil
Abstract :
This work aims to create the simulations by varying the alpha (a - Learning rate) and Gamma (y - Time reduction) values, such parameters found in the q-learning algorithm, which is possible to analyze the algorithms convergence, on what concerns the variations of these parameters. This work seeks to state that the parameters variations of Alpha and Gamma interfere on the convergence of Q-learning algorithm, thus, in the ITS learning.
Keywords :
intelligent tutoring systems; learning (artificial intelligence); ITS learning; alpha values; apprentice modeling; gamma values; q-learning algorithm; temporal analysis; Adaptation models; Analytical models; Computational modeling; Convergence; Learning systems; Machine learning; Learning reinforcement; Machine learning; Q-learning; intelligence tutoring system;
Conference_Titel :
Computer Science and Automation Engineering (CSAE), 2012 IEEE International Conference on
Conference_Location :
Zhangjiajie
Print_ISBN :
978-1-4673-0088-9
DOI :
10.1109/CSAE.2012.6272601