Near-Optimal Policies for Dynamic Multinomial Logit Assortment Selection Models

Dec-31-2018–Neural Information Processing Systems

In this paper we consider the dynamic assortment selection problem under an uncapacitated multinomial-logit (MNL) model. By carefully analyzing a revenue potential function, we show that a trisection based algorithm achieves an item-independent regret bound of O(sqrt(T log log T), which matches information theoretical lower bounds up to iterated logarithmic terms. Our proof technique draws tools from the unimodal/convex bandit literature as well as adaptive confidence parameters in minimax multi-armed bandit problems.

artificial intelligence, big data, data mining, (18 more...)

Neural Information Processing Systems

Dec-31-2018

Conferences PDF

Add feedback

Country:
- North America > United States (0.46)

Technology:
- Information Technology
  - Artificial Intelligence > Representation & Reasoning (1.00)
  - Data Science > Data Mining
    - Big Data (0.91)

Duplicate Docs Excel Report

Title
Near-Optimal Policies for Dynamic Multinomial Logit Assortment Selection Models
Near-Optimal Policies for Dynamic Multinomial Logit Assortment Selection Models

Similar Docs Excel Report more

Title	Similarity	Source
None found