Sequential Batch Learning in Finite-Action Linear Contextual Bandits

Open in new window