Censored Semi-Bandits: A Framework for Resource Allocation with Censored Feedback
Arun Verma, Manjesh Hanawal, Arun Rajkumar, Raman Sankaran
–Neural Information Processing Systems
The problem is challenging because the loss distribution and threshold value of each arm are unknown. We study this novel setting by establishing its'equivalence' to Multiple-Play Multi-Armed Bandits (MP-MAB) andCombinatorial Semi-Bandits.
Neural Information Processing Systems
Feb-11-2026, 23:48:44 GMT