DocumentCode
2553682
Title
Randomized allocation with dimension reduction in a bandit problem with covariates
Author
Qian, Wei ; Yang, Yuhong
Author_Institution
Sch. of Stat., Univ. of Minnesota, Minneapolis, MN, USA
fYear
2012
fDate
29-31 May 2012
Firstpage
1537
Lastpage
1541
Abstract
Multi-armed bandit problem is an important optimization game requiring an exploration-exploitation tradeoff to achieve optimal total reward. We consider a setting where the rewards of bandit machines are associated with covariates, and focus on the approach of nonparametric estimation of the reward functions together with a randomized allocation to balance the exploration and exploitation. To overcome the curse of dimensionality in nonparametric learning, we propose using dimension reduction methods such as sliced inverse regression (SIR) and likelihood acquired directions (LAD) to reduce the dimension of the covariates. To simultaneously achieve variable selection and dimension reduction, we use coordinate-independent sparse estimation (CISE) for the dimension reduction step. Not knowing which individual dimension reduction method is the best, we show that adaptively combining these dimension reduction methods works really well.
Keywords
covariance analysis; game theory; optimisation; random processes; regression analysis; CISE; LAD; SIR; bandit machine; coordinate-independent sparse estimation; covariates; curse of dimensionality; dimension reduction; exploration-exploitation tradeoff; likelihood acquired direction; multiarmed bandit problem; nonparametric estimation; nonparametric learning; optimal total reward; optimization game; randomized allocation; reward function; sliced inverse regression; variable selection; Analytical models; Covariance matrix; Educational institutions; Estimation; Kernel; Resource management; Zinc;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems and Knowledge Discovery (FSKD), 2012 9th International Conference on
Conference_Location
Sichuan
Print_ISBN
978-1-4673-0025-4
Type
conf
DOI
10.1109/FSKD.2012.6234368
Filename
6234368
Link To Document