Title : 
How to explore your opponent´s strategy (almost) optimally
         
        
            Author : 
Carmel, David ; Markovitch, Shad
         
        
            Author_Institution : 
Dept. of Comput. Sci., Technion-Israel Inst. of Technol., Haifa, Israel
         
        
        
        
        
        
            Abstract : 
Presents a lookahead-based exploration strategy for a model-based learning agent that enables exploration of the opponent´s behavior during interaction in a multi-agent system. Instead of holding one model, the model-based agent maintains a mixed opponent model, a distribution over a set of models that reflects its uncertainty about the opponent´s strategy. Every action is evaluated according to its long run contribution to the expected utility and to the knowledge regarding the opponent´s strategy. We present an efficient algorithm that returns an almost optimal exploration strategy against a given mixed model, and a learning method for acquiring a mixed model consistent with the opponent´s past behavior. We report experimental results in the Iterated Prisoner´s Dilemma game that demonstrate the superiority of the lookahead-based exploration strategy over other exploration methods
         
        
            Keywords : 
game theory; learning automata; software agents; Iterated Prisoner´s Dilemma game; exploration strategy; lookahead; mixed opponent model; model-based learning agent; multi-agent system; Books; Computer science; Costs; Economic forecasting; History; Laboratories; Learning automata; Learning systems; Multiagent systems; Neural networks; Power generation economics; Predictive models; Uncertainty; Utility theory;
         
        
        
        
            Conference_Titel : 
Multi Agent Systems, 1998. Proceedings. International Conference on
         
        
            Conference_Location : 
Paris
         
        
            Print_ISBN : 
0-8186-8500-X
         
        
        
            DOI : 
10.1109/ICMAS.1998.699033