Title :
A non-myopic approach based on reinforcement learning for multiple moving targets search
Author :
Xu, Yifan ; Tan, Yuejin ; Lian, Zhenyu ; He, Renjie
Author_Institution :
Coll. of Inf. Syst. & Manage., Nat. Univ. of Defense Technol., Changsha, China
Abstract :
Myopic information-based approaches maximizing information gain for single one observation opportunity are effective to search for multiple moving targets in ocean surveillance by space-based sensors. A non-myopic approach based on reinforcement learning is developed in order to maximize information gain for the long term. Reinforcement learning adjusts optimal control policy and learns system behaviors through trial-and-error experience from interactions with a dynamic environment. System states are characterized by the expected information gain, action-value functions are estimated by online SARAR (lambda) algorithm and parameterized control policy is approximated by neural networks. Finally, simulations show that non-myopic approach after sufficient training can provide better performance than myopic approach.
Keywords :
computer vision; learning (artificial intelligence); motion compensation; neural nets; object detection; action value function; multiple moving targets search; myopic information based approach; neural networks; nonmyopic approach; ocean surveillance; online SARAR algorithm; optimal control policy; parameterized control policy; reinforcement learning; space based sensors; Entropy; Learning; Oceans; Resource management; Satellites; Sensor systems; State estimation; Surveillance; Target tracking; Uncertainty; maritime surveillance; multiple moving targets; optimal search theory; reinforcement learning; satellite;
Conference_Titel :
Information and Automation (ICIA), 2010 IEEE International Conference on
Conference_Location :
Harbin
Print_ISBN :
978-1-4244-5701-4
DOI :
10.1109/ICINFA.2010.5512235