Doubly-Robust Self-Training

May-27-2025, 03:49:19 GMT–Neural Information Processing Systems

Self-training is a well-established technique in semi-supervised learning, which leverages unlabeled data by generating pseudo-labels and incorporating them with a limited labeled dataset for training. The effectiveness of self-training heavily relies on the accuracy of these pseudo-labels. In this paper, we introduce doubly-robust self-training, an innovative semi-supervised algorithm that provably balances between two extremes. When pseudo-labels are entirely incorrect, our method reduces to a training process solely using labeled data. Conversely, when pseudo-labels are completely accurate, our method transforms into a training process utilizing all pseudo-labeled data and labeled data, thus increasing the effective sample size.

dataset, doubly-robust self-training

Neural Information Processing Systems

May-27-2025, 03:49:19 GMT

Conferences Web Page

Add feedback

Country:
- Asia > Middle East > Jordan (0.10)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.65)