Linear Multi-Resource Allocation with Semi-Bandit Feedback

Lattimore, Tor, Crammer, Koby, Szepesvari, Csaba

Dec-31-2015–Neural Information Processing Systems

We study an idealised sequential resource allocation problem. In each time step the learner chooses an allocation of several resource types between a number of tasks. Assigning more resources to a task increases the probability that it is completed. The problem is challenging because the alignment of the tasks to the resource types is unknown and the feedback is noisy. Our main contribution is the new setting and an algorithm with nearly-optimal regret analysis. Along the way we draw connections to the problem of minimising regret for stochastic linear bandits with heteroscedastic noise. We also present some new results for stochastic linear bandits on the hypercube that significantly out-performs existing work, especially in the sparse case.

algorithm, allocation, artificial intelligence, (14 more...)

Neural Information Processing Systems

Dec-31-2015

Conferences PDF

Add feedback

Country:
- North America > Canada > Alberta (0.14)

Genre:
- Research Report > New Finding (0.66)

Technology:
- Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)

Duplicate Docs Excel Report

Title
Linear Multi-Resource Allocation with Semi-Bandit Feedback
Linear Multi-Resource Allocation with Semi-Bandit Feedback

Similar Docs Excel Report more

Title	Similarity	Source
None found