DocumentCode :
1634415
Title :
Linear bandits in high dimension and recommendation systems
Author :
Deshpande, Yateendra ; Montanari, Alessandro
Author_Institution :
Dept. of Electr. Eng., Stanford Univ., Stanford, CA, USA
fYear :
2012
Firstpage :
1750
Lastpage :
1754
Abstract :
A large number of online services provide automated recommendations to help users to navigate through a large collection of items. New items (products, videos, songs, advertisements) are suggested on the basis of the user´s past history and - when available - her demographic profile. Recommendations have to satisfy the dual goal of helping the user to explore the space of available items, while allowing the system to probe the user´s preferences. We model this trade-off using linearly parametrized multi-armed bandits and prove upper and lower bounds that coincide up to constants in the data poor (high-dimensional) regime. We test (a variation of) the scheme used for estabilishing achievability on the Netflix dataset, and obtain results in agreement with the theory.
Keywords :
information services; recommender systems; sequential estimation; Netflix dataset; automated recommendations; data poor regime; demographic profile; linear bandits; linearly parametrized multiarmed bandits; lower bounds; online services; recommendation systems; upper bounds; user past history; user preferences; Ellipsoids; History; Motion pictures; Noise; Numerical simulation; Upper bound; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communication, Control, and Computing (Allerton), 2012 50th Annual Allerton Conference on
Conference_Location :
Monticello, IL
Print_ISBN :
978-1-4673-4537-8
Type :
conf
DOI :
10.1109/Allerton.2012.6483433
Filename :
6483433
Link To Document :
بازگشت