• DocumentCode
    423960
  • Title

    A reinforcement learning algorithm to improve scheduling search heuristics with the SVM

  • Author

    Gersmann, Kai ; Hammer, Barbara

  • Author_Institution
    Dept. of Math. & Comput. Sci., Osnabruck Univ., Germany
  • Volume
    3
  • fYear
    2004
  • fDate
    25-29 July 2004
  • Firstpage
    1811
  • Abstract
    A regret-based biased random sampling scheme (RBRS) is a simple but powerful priority-rule based method to solve the resource constrained project scheduling problem (RCPSP), a well-known NP-hard benchmark problem. We present a generic machine learning method to improve results of RBRS. The rout-algorithm of reinforcement learning is combined with the support vector machine (SVM) to learn an appropriate value function which guides the search strategy given by RBRS. The specific properties of the SVM allow to reduce the size of the training set and show improved results even after a short period of training as demonstrated for benchmark instances of the RCPSP.
  • Keywords
    computational complexity; function approximation; learning (artificial intelligence); optimisation; random processes; sampling methods; scheduling; search problems; support vector machines; NP hard benchmark problem; SVM; generic machine learning method; regret based biased random sampling; reinforcement learning algorithm; resource constrained project scheduling problem; rout algorithm; rule based method; search heuristic scheduling; support vector machine; training set; value function approximation; Function approximation; Iterative algorithms; Learning systems; Machine learning; NASA; Payloads; Sampling methods; Scheduling algorithm; Space shuttles; Support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 2004. Proceedings. 2004 IEEE International Joint Conference on
  • ISSN
    1098-7576
  • Print_ISBN
    0-7803-8359-1
  • Type

    conf

  • DOI
    10.1109/IJCNN.2004.1380883
  • Filename
    1380883