Optimal Contextual Bandits with Knapsacks under Realizability via Regression Oracles

Open in new window