TART: Improved Few-shot Text Classification Using Task-Adaptive Reference Transformation

Lei, Shuo, Zhang, Xuchao, He, Jianfeng, Chen, Fanglan, Lu, Chang-Tien

Jun-3-2023–arXiv.org Artificial Intelligence

Meta-learning has emerged as a trending technique to tackle few-shot text classification and achieve state-of-the-art performance. However, the performance of existing approaches heavily depends on the inter-class variance of the support set. As a result, it can perform well on tasks when the semantics of sampled classes are distinct while failing to differentiate classes with similar semantics. In this paper, we propose a novel Task-Adaptive Reference Transformation (TART) network, aiming to enhance the generalization by transforming the class prototypes to per-class fixed reference points in task-adaptive metric spaces. To further maximize divergence between transformed prototypes in task-adaptive metric spaces, TART introduces a discriminative reference regularization among transformed prototypes. Extensive experiments are conducted on four benchmark datasets and our method demonstrates clear superiority over the state-of-the-art models in all the datasets. In particular, our model surpasses the state-of-the-art method by 7.4% and 5.4% in 1-shot and 5-shot classification on the 20 Newsgroups dataset, respectively.

classification, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

Jun-3-2023

arXiv.org PDF

Add feedback

Country:
- North America
  - United States
    - Virginia > Falls Church (0.04)
    - Illinois (0.04)
    - Washington > King County
      - Redmond (0.04)
  - Canada > Ontario
    - Toronto (0.04)
- Asia
  - Myanmar (0.04)
  - Middle East > Israel (0.04)

Genre:
- Research Report > Promising Solution (0.67)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Text Classification (0.64)
  - Machine Learning > Neural Networks
    - Deep Learning (0.47)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found