Thompson Sampling for Contextual Bandits with Linear Payoffs

Open in new window