Non-monotonic Resource Utilization in the Bandits with Knapsacks Problem

Sep-24-2022–arXiv.org Artificial Intelligence

Bandits with knapsacks (BwK) is an influential model of sequential decision-making under uncertainty that incorporates resource consumption constraints. In each round, the decision-maker observes an outcome consisting of a reward and a vector of nonnegative resource consumptions, and the budget of each resource is decremented by its consumption. In this paper we introduce a natural generalization of the stochastic BwK problem that allows non-monotonic resource utilization. In each round, the decision-maker observes an outcome consisting of a reward and a vector of resource drifts that can be positive, negative or zero, and the budget of each resource is incremented by its drift. Our main result is a Markov decision process (MDP) policy that has constant regret against a linear programming (LP) relaxation when the decision-maker knows the true outcome distributions. We build upon this to develop a learning algorithm that has logarithmic regret against the same LP relaxation when the decision-maker does not know the true outcome distributions. We also present a reduction from BwK to our model that shows our regret bound matches existing results.

budget, data mining, machine learning, (20 more...)

arXiv.org Artificial Intelligence

Sep-24-2022

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - New York > Tompkins County > Ithaca (0.04)
- Europe > United Kingdom
  - England > Cambridgeshire > Cambridge (0.04)

Genre:
- Research Report (1.00)

Technology:
- Information Technology
  - Data Science > Data Mining
    - Big Data (0.46)
  - Artificial Intelligence
    - Representation & Reasoning
      - Optimization (0.66)
      - Constraint-Based Reasoning (0.46)
    - Machine Learning
      - Computational Learning Theory (0.64)
      - Learning Graphical Models > Undirected Networks
        Markov Models (0.55)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found