Dynamic pricing and assortment under a contextual MNL demand

Apr-24-2026, 19:14:31 GMT–Neural Information Processing Systems

We consider dynamic multi-product pricing and assortment problems under an unknown demand over T periods, where in each period, the seller decides on the price for each product or the assortment of products to offer to a customer who chooses according to an unknown Multinomial Logit Model (MNL). Such problems arise in many applications, including online retail and advertising. We propose a randomized dynamic pricing policy based on a variant of the Online Newton Step algorithm (ONS) that achieves a O(d T log(T))regret guarantee under an adversarial arrival model. We also present a new optimistic algorithm for the adversarial MNL contextual bandits problem, which achieves a better dependency than the state-of-the-art algorithms in a problem-dependent constant κ2 (potentially exponentially small). Our regret upper bound scales as O(d κ2T +log(T)/κ2), which gives a stronger bound than the existing O(d T/κ2)guarantees.

artificial intelligence, data mining, machine learning, (16 more...)

Neural Information Processing Systems

Apr-24-2026, 19:14:31 GMT

Conferences PDF

Add feedback

Country:
- North America > United States (0.68)

Technology:
- Information Technology
  - Data Science > Data Mining
    - Big Data (0.34)
  - Artificial Intelligence
    - Representation & Reasoning (1.00)
    - Machine Learning > Statistical Learning (0.47)

Duplicate Docs Excel Report

Title
1673a54332b2afc905722048c26f5a4c-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found