Bandit Multiclass Linear Classification: Efficient Algorithms for the Separable Case

Beygelzimer, Alina, Pál, Dávid, Szörényi, Balázs, Thiruvenkatachari, Devanathan, Wei, Chen-Yu, Zhang, Chicheng

Feb-6-2019–arXiv.org Machine Learning

We study the problem of efficient online multiclass linear classification with bandit feedback, where all examples belong to one of $K$ classes and lie in the $d$-dimensional Euclidean space. Previous works have left open the challenge of designing efficient algorithms with finite mistake bounds when the data is linearly separable by a margin $\gamma$. In this work, we take a first step towards this problem. We consider two notions of linear separability, \emph{strong} and \emph{weak}. 1. Under the strong linear separability condition, we design an efficient algorithm that achieves a near-optimal mistake bound of $O\left( K/\gamma^2 \right)$. 2. Under the more challenging weak linear separability condition, we design an efficient algorithm with a mistake bound of $\min (2^{\widetilde{O}(K \log^2 (1/\gamma))}, 2^{\widetilde{O}(\sqrt{1/\gamma} \log K)})$. Our algorithm is based on kernel Perceptron, which is inspired by the work of \citet{Klivans-Servedio-2008} on improperly learning intersection of halfspaces.

algorithm, linearly separable, weakly linearly, (15 more...)

arXiv.org Machine Learning

Feb-6-2019

arXiv.org PDF

Add feedback

Country:
- Asia > China (0.04)
- North America > United States
  - New York > New York County
    - New York City (0.04)
  - California > Los Angeles County
    - Los Angeles (0.28)
- Europe > United Kingdom
  - England > Cambridgeshire > Cambridge (0.04)

Genre:
- Research Report (0.64)

Industry:
- Education > Educational Setting > Online (0.46)

Technology:
- Information Technology
  - Data Science > Data Mining (0.93)
  - Artificial Intelligence > Machine Learning
    - Statistical Learning (1.00)
    - Neural Networks > Perceptrons (0.34)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found