Exploiting the Surrogate Gap in Online Multiclass Classification

Oct-10-2024, 11:22:48 GMT–Neural Information Processing Systems

In the full information setting we provide expected mistake bounds for \textproc{Gaptron} with respect to the logistic loss, hinge loss, and the smooth hinge loss with O(K) regret, where the expectation is with respect to the learner's randomness and K is the number of classes. In the bandit classification setting we show that \textproc{Gaptron} is the first linear time algorithm with O(K\sqrt{T}) expected regret. Additionally, the expected mistake bound of \textproc{Gaptron} does not depend on the dimension of the feature vector, contrary to previous algorithms with O(K\sqrt{T}) regret in the bandit classification setting. We present a new proof technique that exploits the gap between the zero-one loss and surrogate losses rather than exploiting properties such as exp-concavity or mixability, which are traditionally used to prove logarithmic or constant regret bounds.

gaptron, online multiclass classification, textproc, (5 more...)

Neural Information Processing Systems

Oct-10-2024, 11:22:48 GMT

Conferences Web Page

Add feedback

Industry:
- Education > Educational Setting > Online (0.97)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (1.00)