Title :
Finite-Time Performance of Some Two-Armed Bandit Controllers
Author_Institution :
Department of Electrical Engineering Science, University of Essex, Colchester, Essex, England.
fDate :
3/1/1973 12:00:00 AM
Abstract :
A class of asymptotically ¿-optimal two-armed bandit controllers is given, and two criteria for comparing the long-term finite-time performance of controllers in this class are proposed. The performances of three particular controllers are compared using the criteria, and the analysis is confirmed by computer iteration if the appropriate probability recurrence relations.
Keywords :
Arm; History; Optimal control; Proportional control; Shape control; Upper bound;
Journal_Title :
Systems, Man and Cybernetics, IEEE Transactions on
DOI :
10.1109/TSMC.1973.5408504