UCB-based Algorithms for Multinomial Logistic Regression Bandits

Apr-24-2026, 21:31:03 GMT–Neural Information Processing Systems

Out of the rich family of generalized linear bandits, perhaps the most well studied ones are logistic bandits that are used in problems with binary rewards: for instance, when the learner aims to maximize the profit over a user that can select one of two possible outcomes (e.g., 'click' vs'no-click'). Despite remarkable recent progress and improved algorithms for logistic bandits, existing works do not address practical situations where the number of outcomes that can be selected by the user is larger than two (e.g., 'click', 'show me later', 'never show again', 'no click'). In this paper, we study such an extension. We use multinomial logit (MNL) to model the probability of each one of K+1 2possible outcomes (+1 stands for the'not click' outcome): we assume that for a learner's action xt, the user selects one of K +1 2outcomes, say outcome i, with a MNL probabilistic model with corresponding unknown parameter θ i. Each outcome i is also associated with a revenue parameter ρi and the goal is to maximize the expected revenue. For this problem, we present MNL-UCB, an upper confidence bound (UCB)-based algorithm, that achieves regret O(dK T) with small dependency on problemdependent constants that can otherwise be arbitrarily large and lead to loose regret bounds. We present numerical simulations that corroborate our theoretical results.

algorithm, artificial intelligence, machine learning, (17 more...)

Neural Information Processing Systems

Apr-24-2026, 21:31:03 GMT

Conferences PDF

Add feedback

Country:
- North America > United States > California (0.46)

Genre:
- Research Report
  - New Finding (0.82)
  - Experimental Study (0.50)

Industry:
- Health & Medicine > Pharmaceuticals & Biotechnology (0.83)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Uncertainty (0.48)
  - Machine Learning > Statistical Learning
    - Regression (0.40)

Duplicate Docs Excel Report

Title
16f852a6d01b6065c8ff5cc11caae9c6-Supplemental.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found