Uncertainty-aware Self-training for Few-shot Text Classification

Jan-15-2025, 11:42:45 GMT–Neural Information Processing Systems

Recent success of pre-trained language models crucially hinges on fine-tuning them on large amounts of labeled data for the downstream task, that are typically expensive to acquire or difficult to access for many applications. We study self-training as one of the earliest semi-supervised learning approaches to reduce the annotation bottleneck by making use of large-scale unlabeled data for the target task. Standard self-training mechanism randomly samples instances from the unlabeled pool to generate pseudo-labels and augment labeled data. We propose an approach to improve self-training by incorporating uncertainty estimates of the underlying neural network leveraging recent advances in Bayesian deep learning. Specifically, we propose (i) acquisition functions to select instances from the unlabeled pool leveraging Monte Carlo (MC) Dropout, and (ii) learning mechanism leveraging model confidence for self-training.

few-shot text classification, uncertainty-aware self-training, unlabeled pool, (1 more...)

Neural Information Processing Systems

Jan-15-2025, 11:42:45 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Text Classification (0.45)
  - Machine Learning
    - Unsupervised or Indirectly Supervised Learning (0.63)
    - Neural Networks (0.63)