Explicit Learning and the LLM in Machine Translation
Marmonier, Malik, Bawden, Rachel, Sagot, Benoît
–arXiv.org Artificial Intelligence
This study explores the capacity of large language models (LLMs) for explicit learning, a process involving the assimilation of metalinguistic explanations to carry out language tasks. Using constructed languages generated by cryptographic means as controlled test environments, we designed experiments to assess an LLM's ability to explicitly learn and apply grammar rules. Our results demonstrate that while LLMs possess a measurable capacity for explicit learning, this ability diminishes as the complexity of the linguistic phenomena at hand increases. Supervised fine-tuning on chains of thought significantly enhances LLM performance but struggles to generalize to typologically novel or more complex linguistic features. These findings point to the need for more diverse training sets and alternative fine-tuning strategies to further improve explicit learning by LLMs.
arXiv.org Artificial Intelligence
Mar-19-2025
- Country:
- South America > Chile
- North America
- Dominican Republic (0.04)
- United States > Florida
- Miami-Dade County > Miami (0.04)
- Mexico > Mexico City
- Mexico City (0.04)
- Europe
- United Kingdom > England
- Oxfordshire > Oxford (0.04)
- Cambridgeshire > Cambridge (0.04)
- Middle East > Cyprus
- France > Île-de-France
- Bulgaria > Sofia City Province
- Sofia (0.04)
- United Kingdom > England
- Asia
- Middle East > Jordan (0.04)
- Singapore (0.04)
- Thailand > Bangkok
- Bangkok (0.04)
- Myanmar > Tanintharyi Region
- Dawei (0.04)
- Japan > Honshū
- Tōhoku > Iwate Prefecture
- Morioka (0.04)
- Chūbu > Toyama Prefecture
- Toyama (0.04)
- Tōhoku > Iwate Prefecture
- Genre:
- Research Report > New Finding (1.00)
- Technology: