DocumentCode
423960
Title
A reinforcement learning algorithm to improve scheduling search heuristics with the SVM
Author
Gersmann, Kai ; Hammer, Barbara
Author_Institution
Dept. of Math. & Comput. Sci., Osnabruck Univ., Germany
Volume
3
fYear
2004
fDate
25-29 July 2004
Firstpage
1811
Abstract
A regret-based biased random sampling scheme (RBRS) is a simple but powerful priority-rule based method to solve the resource constrained project scheduling problem (RCPSP), a well-known NP-hard benchmark problem. We present a generic machine learning method to improve results of RBRS. The rout-algorithm of reinforcement learning is combined with the support vector machine (SVM) to learn an appropriate value function which guides the search strategy given by RBRS. The specific properties of the SVM allow to reduce the size of the training set and show improved results even after a short period of training as demonstrated for benchmark instances of the RCPSP.
Keywords
computational complexity; function approximation; learning (artificial intelligence); optimisation; random processes; sampling methods; scheduling; search problems; support vector machines; NP hard benchmark problem; SVM; generic machine learning method; regret based biased random sampling; reinforcement learning algorithm; resource constrained project scheduling problem; rout algorithm; rule based method; search heuristic scheduling; support vector machine; training set; value function approximation; Function approximation; Iterative algorithms; Learning systems; Machine learning; NASA; Payloads; Sampling methods; Scheduling algorithm; Space shuttles; Support vector machines;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks, 2004. Proceedings. 2004 IEEE International Joint Conference on
ISSN
1098-7576
Print_ISBN
0-7803-8359-1
Type
conf
DOI
10.1109/IJCNN.2004.1380883
Filename
1380883
Link To Document