DocumentCode :
3438936
Title :
Stochastic bandits with pathwise constraints
Author :
Avner, Orly ; Mannor, Shie
Author_Institution :
Israel Inst. of Technol., Technion - Israel Inst. of Technol., Haifa, Israel
fYear :
2011
fDate :
12-15 Dec. 2011
Firstpage :
3862
Lastpage :
3869
Abstract :
We consider the problem of stochastic bandits, with the goal of maximizing a reward while satisfying pathwise constraints. The motivation for this problem comes from cognitive radio networks, in which agents need to choose between different transmission profiles to maximize throughput under certain operational constraints such as limited average power. Stochastic bandits serve as a natural model for an unknown, stationary environment. We propose an algorithm, based on a steering approach, and analyze its regret with respect to the optimal stationary policy that knows the statistics of the different arms.
Keywords :
cognitive radio; constraint satisfaction problems; constraint theory; stochastic processes; cognitive radio networks; optimal stationary policy; pathwise constraint satisfaction; steering approach; stochastic bandits; transmission profiles; Algorithm design and analysis; Cognitive radio; Convergence; Indexes; Loss measurement; Optimization; Stochastic processes;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Decision and Control and European Control Conference (CDC-ECC), 2011 50th IEEE Conference on
Conference_Location :
Orlando, FL
ISSN :
0743-1546
Print_ISBN :
978-1-61284-800-6
Electronic_ISBN :
0743-1546
Type :
conf
DOI :
10.1109/CDC.2011.6161093
Filename :
6161093
Link To Document :
بازگشت