DocumentCode :
3561942
Title :
Branching bandits and Klimov´s problem: achievable region and side constraints
Author :
Bertsimas, Dimitris ; Paschalidis, Ioannis Ch ; Tsitsiklis, John N.
Author_Institution :
Lab. for Inf. & Decision Syst., MIT, Cambridge, MA, USA
Volume :
1
fYear :
1994
Firstpage :
174
Abstract :
Considers the average cost branching bandits problem and its special case known as Klimov´s problem. The authors consider the vector n whose components are the mean number of bandits (or customers) of each type that are present. The authors characterize fully the achievable region, that is, the set of all possible vectors n that can be obtained by considering all possible policies. While the original description of the achievable region involves exponentially many constraints, the authors also develop an alternative description that involves only O(R 2) variables and constraints, where R is the number of bandit types (or customer classes). The authors then consider the problem of minimizing a linear function of n subject to L additional linear constraints on n. The authors show that optimal policies can be obtained by randomizing between L+1 strict priority policies that can be found efficiently (in polynomial time) using linear programming techniques
Keywords :
discrete event systems; game theory; linear programming; queueing theory; stochastic systems; Klimov´s problem; achievable region; average cost branching bandits problem; linear function minimisation; linear programming techniques; optimal policies; side constraints; strict priority policies; Control systems; Cost function; Feedback; Laboratories; Linear programming; Polynomials; Stochastic processes; Stochastic systems;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Decision and Control, 1994., Proceedings of the 33rd IEEE Conference on
Print_ISBN :
0-7803-1968-0
Type :
conf
DOI :
10.1109/CDC.1994.411026
Filename :
411026
Link To Document :
بازگشت