Online Semi-Supervised Learning with Bandit Feedback

Upadhyay, Sohini, Yurochkin, Mikhail, Agarwal, Mayank, Khazaeni, Yasaman, DjallelBouneffouf, null

Oct-23-2020–arXiv.org Machine Learning

We formulate a new problem at the intersectionof semi-supervised learning and contextual bandits,motivated by several applications including clini-cal trials and ad recommendations. We demonstratehow Graph Convolutional Network (GCN), a semi-supervised learning approach, can be adjusted tothe new problem formulation. We also propose avariant of the linear contextual bandit with semi-supervised missing rewards imputation. We thentake the best of both approaches to develop multi-GCN embedded contextual bandit. Our algorithmsare verified on several real world datasets.

bandit, inductive learning, neural network, (18 more...)

arXiv.org Machine Learning

Oct-23-2020

arXiv.org PDF

Add feedback

Country:
- Asia (0.46)
- Oceania > Australia
  - Victoria > Melbourne (0.14)

Genre:
- Overview (0.68)
- Research Report (0.64)

Industry:
- Health & Medicine (0.48)
- Information Technology (0.46)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Inductive Learning (0.82)
  - Neural Networks (1.00)
  - Statistical Learning (1.00)
  - Unsupervised or Indirectly Supervised Learning (0.91)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found