• DocumentCode
    2723963
  • Title

    Approximation Algorithms for Correlated Knapsacks and Non-martingale Bandits

  • Author

    Gupta, Anupam ; Krishnaswamy, Ravishankar ; Molinaro, Marco ; Ravi, R.

  • Author_Institution
    Comput. Sci. Dept., Carnegie Mellon Univ., Pittsburgh, PA, USA
  • fYear
    2011
  • fDate
    22-25 Oct. 2011
  • Firstpage
    827
  • Lastpage
    836
  • Abstract
    In the stochastic knapsack problem, we are given a knapsack of size B, and a set of items whose sizes and rewards are drawn from a known probability distribution. To know the actual size and reward we have to schedule the item-when it completes, we get to know these values. The goal is to schedule the items (possibly making adaptive decisions based on the sizes seen so far) to maximize the expected total reward of items which successfully pack into the knapsack. We know constant-factor approximations when (i) the rewards and sizes are independent, and (ii) we cannot prematurely cancel items after we schedule them. What if either or both assumptions are relaxed? Related stochastic packing problems are the multi-armed bandit (and budgeted learning) problems, here one is given several arms which evolve in a specified stochastic fashion with each pull, and the goal is to (adaptively) decide which arms to pull, in order to maximize the expected reward obtained after B pulls in total. Much recent work on this problem focuses on the case when the evolution of each arm follows a martingale, i.e., when the expected reward from one pull of an arm is the same as the reward at the current state. What if the rewards do not form a martingale? In this paper, we give O(1)-approximation algorithms for the stochastic knapsack problem with correlations and/or cancellations. Extending the ideas developed here, we give O(1)-approximations for MAB problems without the martingale assumption. Indeed, we can show that previously proposed linear programming relaxations for these problems have large integrality gaps. So we propose new time-indexed LP relaxations, using a decomposition and "gap-filling" approach, we convert these fractional solutions to distributions over strategies, and then use the LP values and the time ordering information from these strategies to devise randomized adaptive scheduling algorithms.
  • Keywords
    approximation theory; bin packing; computational complexity; knapsack problems; probability; randomised algorithms; stochastic processes; MAB problems; O(1)-approximation algorithms; constant-factor approximations; correlated knapsacks; nonmartingale bandits; probability distribution; randomized adaptive scheduling algorithms; stochastic knapsack problem; time-indexed LP relaxations; Approximation algorithms; Approximation methods; Markov processes; Optimized production technology; Random variables; Silicon; Approximation Algorithms; Multi-Armed Bandits; Stochastic Optimization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Foundations of Computer Science (FOCS), 2011 IEEE 52nd Annual Symposium on
  • Conference_Location
    Palm Springs, CA
  • ISSN
    0272-5428
  • Print_ISBN
    978-1-4577-1843-4
  • Type

    conf

  • DOI
    10.1109/FOCS.2011.48
  • Filename
    6108253