Bandit-Feedback Online Multiclass Classification: Variants and Tradeoffs
–Neural Information Processing Systems
Consider the domain of multiclass classification within the adversarial online setting. What is the price of relying on bandit feedback as opposed to full information? To what extent can an adaptive adversary amplify the loss compared to an oblivious one? To what extent can a randomized learner reduce the loss compared to a deterministic one?
Neural Information Processing Systems
Oct-10-2025, 20:14:15 GMT
- Country:
- Asia > Middle East
- Israel (0.04)
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- North America > United States (0.14)
- Asia > Middle East
- Genre:
- Research Report > Experimental Study (0.93)
- Industry:
- Education > Educational Setting > Online (1.00)
- Technology: