Title :
Randomized allocation with dimension reduction in a bandit problem with covariates
Author :
Qian, Wei ; Yang, Yuhong
Author_Institution :
Sch. of Stat., Univ. of Minnesota, Minneapolis, MN, USA
Abstract :
Multi-armed bandit problem is an important optimization game requiring an exploration-exploitation tradeoff to achieve optimal total reward. We consider a setting where the rewards of bandit machines are associated with covariates, and focus on the approach of nonparametric estimation of the reward functions together with a randomized allocation to balance the exploration and exploitation. To overcome the curse of dimensionality in nonparametric learning, we propose using dimension reduction methods such as sliced inverse regression (SIR) and likelihood acquired directions (LAD) to reduce the dimension of the covariates. To simultaneously achieve variable selection and dimension reduction, we use coordinate-independent sparse estimation (CISE) for the dimension reduction step. Not knowing which individual dimension reduction method is the best, we show that adaptively combining these dimension reduction methods works really well.
Keywords :
covariance analysis; game theory; optimisation; random processes; regression analysis; CISE; LAD; SIR; bandit machine; coordinate-independent sparse estimation; covariates; curse of dimensionality; dimension reduction; exploration-exploitation tradeoff; likelihood acquired direction; multiarmed bandit problem; nonparametric estimation; nonparametric learning; optimal total reward; optimization game; randomized allocation; reward function; sliced inverse regression; variable selection; Analytical models; Covariance matrix; Educational institutions; Estimation; Kernel; Resource management; Zinc;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2012 9th International Conference on
Conference_Location :
Sichuan
Print_ISBN :
978-1-4673-0025-4
DOI :
10.1109/FSKD.2012.6234368