Cascading Bandits: Optimizing Recommendation Frequency in Delayed Feedback Environments

Feb-11-2025, 17:04:11 GMT–Neural Information Processing Systems

Delayed feedback is a critical problem in dynamic recommender systems. In practice, the feedback result often depends on the frequency of recommendation. Most existing online learning literature fails to consider optimization of the recommendation frequency, and regards the reward from each successfully recommended message as equal. In this paper, we consider a novel cascading bandits setting, where individual messages from a selected list are sent to a user periodically. Whenever a user does not like a message, she may abandon the system with a probability positively correlated with the recommendation frequency.

algorithm, artificial intelligence, machine learning, (16 more...)

Neural Information Processing Systems

Feb-11-2025, 17:04:11 GMT

Conferences PDF

Add feedback

Country:
- North America > United States (0.46)

Industry:
- Education (0.69)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning (1.00)
  - Representation & Reasoning > Personal Assistant Systems (0.88)

Duplicate Docs Excel Report

Title
f95606d8e870020085990d9650b4f2a1-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found