Domain Adversarial Fine-Tuning as an Effective Regularizer

Vernikos, Giorgos, Margatina, Katerina, Chronopoulou, Alexandra, Androutsopoulos, Ion

Oct-5-2020–arXiv.org Machine Learning

In Natural Language Processing (NLP), pretrained language models (LMs) that are transferred to downstream tasks have been recently shown to achieve state-of-the-art results. However, standard fine-tuning can degrade the general-domain representations captured during pretraining. To address this issue, we introduce a new regularization technique, AFTER; domain Adversarial Fine-Tuning as an Effective Regularizer. Specifically, we complement the task-specific loss used during fine-tuning with an adversarial objective. This additional loss term is related to an adversarial classifier, that aims to discriminate between in-domain and out-of-domain text representations. In-domain refers to the labeled dataset of the task at hand while out-of-domain refers to unlabeled data from a different domain. Intuitively, the adversarial classifier acts as a regularizer which prevents the model from overfitting to the task-specific domain. Empirical results on various natural language understanding tasks show that AFTER leads to improved performance compared to standard fine-tuning.

dataset, neural network, text processing, (18 more...)

arXiv.org Machine Learning

Oct-5-2020

arXiv.org PDF

Add feedback

Country:
- Europe
  - Germany (0.14)
  - Greece (0.14)
- North America > United States (0.14)

Genre:
- Research Report (0.64)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning
    - Neural Networks (0.47)
    - Unsupervised or Indirectly Supervised Learning (0.35)
  - Natural Language > Text Processing (0.47)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found