DocumentCode :
170371
Title :
Learning in hide-and-seek
Author :
Qingsi Wang ; Mingyan Liu
Author_Institution :
EECS Dept., Univ. of Michigan, Ann Arbor, MI, USA
fYear :
2014
fDate :
April 27 2014-May 2 2014
Firstpage :
217
Lastpage :
225
Abstract :
Existing work on pursuit-evasion problems typically either assumes stationary or heuristic behavior of one side and examines countermeasures of the other, or assumes both sides to be strategic which leads to a game theoretical framework. Results from the former may lack robustness against changes in the adversarial behavior, while those from the latter are often difficult to justify due to the implied full information (either as realizations or as distributions) and rationality, both of which may be limited in practice. In this paper, we take a different approach by assuming an intelligent pursuer/evader that is adaptive to the information available to it and is capable of learning over time with performance guarantee. Within this context we investigate two cases. In the first case we assume either the evader or the pursuer is aware of the type of learning algorithm used by the opponent, while in the second case neither side has such information and thus must try to learn. We show that the optimal policies in the first case have a greedy nature, hiding/seeking in the location that the opponent is the least/most likely to appear. This result is then used to assess the performance of the learning algorithms that both sides employ in the second case, which is shown to be mutually optimal and there is no loss for either side compared to the case when it completely knows the adaptive pattern used by the adversary and responses optimally.
Keywords :
game theory; learning (artificial intelligence); adaptive pattern; adversarial behavior; game theoretical framework; greedy nature; hide-and-seek; intelligent pursuer-evader; learning algorithm; optimal policies; performance guarantee; pursuit-evasion problems; Adaptation models; Computational modeling; Computers; Conferences; Context; Games; Probabilistic logic;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
INFOCOM, 2014 Proceedings IEEE
Conference_Location :
Toronto, ON
Type :
conf
DOI :
10.1109/INFOCOM.2014.6847942
Filename :
6847942
Link To Document :
بازگشت