Grad2Task: Improved Few-shot Text Classification Using Gradients for Task Representation

Oct-10-2024, 00:23:50 GMT–Neural Information Processing Systems

Large pretrained language models (LMs) like BERT have improved performance in many disparate natural language processing (NLP) tasks. However, fine tuning such models requires a large number of training examples for each target task. Simultaneously, many realistic NLP problems are "few shot", without a sufficiently large training set. In this work, we propose a novel conditional neural process-based approach for few-shot text classification that learns to transfer from other diverse tasks with rich annotation. Our key idea is to represent each task using gradient information from a base model and to train an adaptation network that modulates a text classifier conditioned on the task representation.

grad2task, improved few-shot text classification, task representation, (2 more...)

Neural Information Processing Systems

Oct-10-2024, 00:23:50 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Text Classification (0.65)
  - Machine Learning > Inductive Learning (0.64)