• DocumentCode
    3728143
  • Title

    Rewarding Air Combat Behavior in Training Simulations

  • Author

    Armon Toubman;Jan Joris Roessingh;Pieter Spronck;Aske Plaat;Jaap van den Herik

  • Author_Institution
    Dept. of Training, Simulation, &
  • fYear
    2015
  • Firstpage
    1397
  • Lastpage
    1402
  • Abstract
    Computer generated forces (CGFs) inhabiting air combat training simulations must show realistic and adaptive behavior to effectively perform their roles as allies and adversaries. In earlier work, behavior for these CGFs was successfully generated using reinforcement learning. However, due to missile hits being subject to chance (a.k.a. The probability of-kill), the CGFs have in certain cases been improperly rewarded and punished. We surmise that taking this probability of-kill into account in the reward function will improve performance. To remedy the false rewards and punishments, a new reward function is proposed that rewards agents based on the expected outcome of their actions. Tests show that the use of this function significantly increases the performance of the CGFs in various scenarios, compared to the previous reward function and a naïve baseline. Based on the results, the new reward function allows the CGFs to generate more intelligent behavior, which enables better training simulations.
  • Keywords
    "Missiles","Atmospheric modeling","Training","Computational modeling","Adaptation models","Radar","Fires"
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man, and Cybernetics (SMC), 2015 IEEE International Conference on
  • Type

    conf

  • DOI
    10.1109/SMC.2015.248
  • Filename
    7379380