Piecewise-Stationary Bandits with Knapsacks

Neural Information Processing Systems 

We propose a novel inventory reserving algorithm which draws new insights into Bandits with Knapsacks (Bwk) problems in piecewise-stationary environments.