Curricular Transfer Learning for Sentence Encoded Tasks
de Sá, Jader Martins Camboim, Sanches, Matheus Ferraroni, de Souza, Rafael Roque, Reis, Júlio Cesar dos, Villas, Leandro Aparecido
–arXiv.org Artificial Intelligence
Fine-tuning language models in a downstream task is the standard approach for many state-of-the-art methodologies in the field of NLP. However, when the distribution between the source task and target task drifts, \textit{e.g.}, conversational environments, these gains tend to be diminished. This article proposes a sequence of pre-training steps (a curriculum) guided by "data hacking" and grammar analysis that allows further gradual adaptation between pre-training distributions. In our experiments, we acquire a considerable improvement from our method compared to other known pre-training approaches for the MultiWoZ task.
arXiv.org Artificial Intelligence
Aug-3-2023
- Country:
- North America
- Dominican Republic (0.04)
- United States
- Washington > King County
- Seattle (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Washington > King County
- Europe
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Spain > Galicia
- Madrid (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- Italy > Tuscany
- Florence (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- United Kingdom > England
- Asia
- China (0.04)
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- North America
- Genre:
- Research Report (1.00)
- Instructional Material > Course Syllabus & Notes (0.68)
- Industry:
- Education > Curriculum (0.68)
- Technology: