Knapsack based Optimal Policies for Budget-Limited Multi-Armed Bandits

Open in new window