A Practical Guide to Fine-tuning Language Models with Limited Data
Szép, Márton, Rueckert, Daniel, von Eisenhart-Rothe, Rüdiger, Hinterwimmer, Florian
–arXiv.org Artificial Intelligence
Employing pre-trained Large Language Models (LLMs) has become the de facto standard in Natural Language Processing (NLP) despite their extensive data requirements. Motivated by the recent surge in research focused on training LLMs with limited data, particularly in low-resource domains and languages, this paper surveys recent transfer learning approaches to optimize model performance in downstream tasks where data is scarce. We first address initial and continued pre-training strategies to better leverage prior knowledge in unseen domains and languages. We then examine how to maximize the utility of limited data during fine-tuning and few-shot learning. The final section takes a task-specific perspective, reviewing models and methods suited for different levels of data scarcity. Our goal is to provide practitioners with practical guidelines for overcoming the challenges posed by constrained data while also highlighting promising directions for future research.
arXiv.org Artificial Intelligence
Nov-14-2024
- Country:
- Oceania > Australia
- North America
- Dominican Republic (0.04)
- United States
- Washington > King County
- Seattle (0.04)
- New York > New York County
- New York City (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Washington > King County
- Canada
- Ontario > Toronto (0.05)
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- Europe
- Spain (0.04)
- Switzerland (0.04)
- Germany > Bavaria
- Upper Bavaria > Munich (0.04)
- Italy > Tuscany
- Florence (0.04)
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Romania > Sud - Muntenia Development Region
- Giurgiu County > Giurgiu (0.04)
- United Kingdom > England
- Greater London > London (0.04)
- Middle East
- Malta > Eastern Region
- Northern Harbour District > St. Julian's (0.04)
- Cyprus > Nicosia
- Nicosia (0.04)
- Malta > Eastern Region
- Croatia > Dubrovnik-Neretva County
- Dubrovnik (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Asia
- Singapore (0.05)
- China > Hong Kong (0.04)
- Middle East
- Jordan (0.04)
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.04)
- Genre:
- Overview (1.00)
- Research Report (0.81)
- Industry:
- Health & Medicine (1.00)
- Education (0.67)
- Technology: