Curricular Transfer Learning for Sentence Encoded Tasks

de Sá, Jader Martins Camboim, Sanches, Matheus Ferraroni, de Souza, Rafael Roque, Reis, Júlio Cesar dos, Villas, Leandro Aparecido

arXiv.org Artificial Intelligence 

Fine-tuning language models in a downstream task is the standard approach for many state-of-the-art methodologies in the field of NLP. However, when the distribution between the source task and target task drifts, \textit{e.g.}, conversational environments, these gains tend to be diminished. This article proposes a sequence of pre-training steps (a curriculum) guided by "data hacking" and grammar analysis that allows further gradual adaptation between pre-training distributions. In our experiments, we acquire a considerable improvement from our method compared to other known pre-training approaches for the MultiWoZ task.

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found