Proportional Response: Contextual Bandits for Simple and Cumulative Regret Minimization

Open in new window