Bandit Task Assignment with Unknown Processing Time
–Neural Information Processing Systems
This study considers a novel problem setting, referred to as bandit task assignment, that incorporates the processing time of each task in the bandit setting. In this problem setting, a player sequentially chooses a set of tasks to start so that the set of processing tasks satisfies a given combinatorial constraint. The reward and processing time for each task follow unknown distributions, values of which are revealed only after the task has been completed.
Neural Information Processing Systems
Apr-25-2026, 14:05:55 GMT
- Technology: