Thompson Sampling for Multinomial Logit Contextual Bandits

Open in new window