Knapsack Based Optimal Policies for Budget–Limited Multi–Armed Bandits

Open in new window