Nearly Minimax-Optimal Regret for Linearly Parameterized Bandits

Open in new window