Improved Online Confidence Bounds for Multinomial Logistic Bandits

Open in new window