A reinforcement learning algorithm to improve scheduling search heuristics with the SVM

Author

Gersmann, Kai ; Hammer, Barbara

Author_Institution

Dept. of Math. & Comput. Sci., Osnabruck Univ., Germany

Volume

3

fYear

2004

fDate

25-29 July 2004

Firstpage

1811

Abstract

A regret-based biased random sampling scheme (RBRS) is a simple but powerful priority-rule based method to solve the resource constrained project scheduling problem (RCPSP), a well-known NP-hard benchmark problem. We present a generic machine learning method to improve results of RBRS. The rout-algorithm of reinforcement learning is combined with the support vector machine (SVM) to learn an appropriate value function which guides the search strategy given by RBRS. The specific properties of the SVM allow to reduce the size of the training set and show improved results even after a short period of training as demonstrated for benchmark instances of the RCPSP.

Keywords

computational complexity; function approximation; learning (artificial intelligence); optimisation; random processes; sampling methods; scheduling; search problems; support vector machines; NP hard benchmark problem; SVM; generic machine learning method; regret based biased random sampling; reinforcement learning algorithm; resource constrained project scheduling problem; rout algorithm; rule based method; search heuristic scheduling; support vector machine; training set; value function approximation; Function approximation; Iterative algorithms; Learning systems; Machine learning; NASA; Payloads; Sampling methods; Scheduling algorithm; Space shuttles; Support vector machines;

fLanguage

English

Publisher

ieee

Conference_Titel

Neural Networks, 2004. Proceedings. 2004 IEEE International Joint Conference on

ISSN

1098-7576

Print_ISBN

0-7803-8359-1

Type

conf

DOI

10.1109/IJCNN.2004.1380883

Filename

1380883