Multiclass Online Learnability under Bandit Feedback

Raman, Ananth, Raman, Vinod, Subedi, Unique, Tewari, Ambuj

Sep-20-2023–arXiv.org Machine Learning

We study online multiclass classification under bandit feedback. We extend the results of Daniely and Helbertal [2013] by showing that the finiteness of the Bandit Littlestone dimension is necessary and sufficient for bandit online multiclass learnability even when the label space is unbounded. Moreover, we show that, unlike the full-information setting, sequential uniform convergence is necessary but not sufficient for bandit online learnability. Our result complements the recent work by Hanneke, Moran, Raman, Subedi, and Tewari [2023] who show that the Littlestone dimension characterizes online multiclass learnability in the full-information setting even when the label space is unbounded.

artificial intelligence, learnability, machine learning, (17 more...)

arXiv.org Machine Learning

Sep-20-2023

arXiv.org PDF

Add feedback

Genre:
- Research Report > New Finding (0.34)

Industry:
- Education > Educational Setting > Online (0.89)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found