DocumentCode
70622
Title
Random-Walk Perturbations for Online Combinatorial Optimization
Author
Devroye, Luc ; Lugosi, Gabor ; Neu, Gergely
Author_Institution
Sch. of Comput. Sci., McGill Univ., Montreal, QC, Canada
Volume
61
Issue
7
fYear
2015
fDate
Jul-15
Firstpage
4099
Lastpage
4106
Abstract
We study online combinatorial optimization problems that a learner is interested in minimizing its cumulative regret in the presence of switching costs. To solve such problems, we propose a version of the follow-the-perturbed-leader algorithm in which the cumulative losses are perturbed by independent symmetric random walks. In the general setting, our forecaster is shown to enjoy near-optimal guarantees on both quantities of interest, making it the best known efficient algorithm for the studied problem. In the special case of prediction with expert advice, we show that the forecaster achieves an expected regret of the optimal order O(n log N)1/2), where n is the time horizon and N is the number of experts, while guaranteeing that the predictions are switched at most O(n log N)1/2) times, in expectation.
Keywords
combinatorial mathematics; optimisation; cumulative regret; expert advice; follow-the-perturbed-leader algorithm; independent symmetric random walks; online combinatorial optimization; random-walk perturbations; switching costs; Algorithm design and analysis; Optimization; Prediction algorithms; Random variables; Standards; Switches; Upper bound; Follow the Perturbed Leader; Online combinatorial optimization; Online learning; Random walk; follow the perturbed leader; online combinatorial; optimization; random walk;
fLanguage
English
Journal_Title
Information Theory, IEEE Transactions on
Publisher
ieee
ISSN
0018-9448
Type
jour
DOI
10.1109/TIT.2015.2428253
Filename
7110333
Link To Document