Nearly Minimax Optimal Regret for Multinomial Logistic Bandit

Neural Information Processing Systems 

Hence, the following research questions arise: What is the optimal regret lower bound in contextual MNL bandits?

Similar Docs  Excel Report  more

TitleSimilaritySource
None found