Title : 
Learning automata in games with memory with application to circuit-switched routing
         
        
        
            Author_Institution : 
Dept. of Electr. & Comput. Eng., Boston Univ., MA, USA
         
        
        
        
        
        
            Abstract : 
A general setting is considered in which autonomous users interact by means of a finite-state controlled Markov process. This process is driven by the collective actions of all users, and individual users receive separate rewards according to its state. It is assumed that each user chooses its actions via a reinforcement learning algorithm based on its local information. The dynamic behavior of user strategies is characterized for small values of a step-size parameter adopted in learning. The general form of equilibria is obtained and is shown to be analogous to Wardrop equilibria if users update their strategies on a faster time-scale compared to the underlying process. The results are illustrated in the context of routing in circuit-switched communication networks.
         
        
            Keywords : 
Markov processes; game theory; learning automata; telecommunication network routing; Wardrop equilibria; autonomous users; circuit-switched communication networks; circuit-switched routing; finite-state controlled Markov process; learning automata; reinforcement learning algorithm; step-size parameter; Automatic control; Circuits; Communication networks; Communication system control; Context modeling; History; Learning automata; Markov processes; Process control; Routing;
         
        
        
        
            Conference_Titel : 
Decision and Control, 2004. CDC. 43rd IEEE Conference on
         
        
        
            Print_ISBN : 
0-7803-8682-5
         
        
        
            DOI : 
10.1109/CDC.2004.1429565