Almost Optimal Batch-Regret Tradeoff for Batch Linear Contextual Bandits

Open in new window