Multi-User Contextual Cascading Bandits for Personalized Recommendation

Aug-26-2025–arXiv.org Machine Learning

We introduce a Multi-User Contextual Cascading Bandit model, a new combinatorial bandit framework that captures realistic online advertising scenarios where multiple users interact with sequentially displayed items simultaneously. Unlike classical contextual bandits, MCCB integrates three key structural elements: (i) cascading feedback based on sequential arm exposure, (ii) parallel context sessions enabling selective exploration, and (iii) heterogeneous arm-level rewards. We first propose Upper Confidence Bound with Backward Planning (UCBBP), a UCB-style algorithm tailored to this setting, and prove that it achieves a regret bound of $\widetilde{O}(\sqrt{THN})$ over $T$ episodes, $H$ session steps, and $N$ contexts per episode. Motivated by the fact that many users interact with the system simultaneously, we introduce a second algorithm, termed Active Upper Confidence Bound with Backward Planning (AUCBBP), which shows a strict efficiency improvement in context scaling, i.e., user scaling, with a regret bound of $\widetilde{O}(\sqrt{T+HN})$. We validate our theoretical findings via numerical experiments, demonstrating the empirical effectiveness of both algorithms under various settings.

artificial intelligence, data mining, machine learning, (14 more...)

arXiv.org Machine Learning

Aug-26-2025

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Virginia > Arlington County
    - Arlington (0.04)
  - New York > New York County
    - New York City (0.04)
  - Georgia > Fulton County
    - Atlanta (0.04)
  - Florida > Broward County
    - Fort Lauderdale (0.04)
  - California > Alameda County
    - Berkeley (0.04)
- Europe
  - United Kingdom > Scotland
    - City of Edinburgh > Edinburgh (0.04)
  - France > Hauts-de-France
    - Nord > Lille (0.04)

Genre:
- Research Report (0.50)

Industry:
- Marketing (1.00)

Technology:
- Information Technology
  - Data Science > Data Mining (1.00)
  - Artificial Intelligence
    - Machine Learning (1.00)
    - Representation & Reasoning > Personal Assistant Systems (0.84)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found