Bandit Task Assignment with Unknown Processing Time

Neural Information Processing Systems 

This study considers a novel problem setting, referred to as bandit task assignment, that incorporates the processing time of each task in the bandit setting. In this problem setting, a player sequentially chooses a set of tasks to start so that the set of processing tasks satisfies a given combinatorial constraint. The reward and processing time for each task follow unknown distributions, values of which are revealed only after the task has been completed.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found