Explicit Learning and the LLM in Machine Translation

Marmonier, Malik, Bawden, Rachel, Sagot, Benoît

Mar-19-2025–arXiv.org Artificial Intelligence

This study explores the capacity of large language models (LLMs) for explicit learning, a process involving the assimilation of metalinguistic explanations to carry out language tasks. Using constructed languages generated by cryptographic means as controlled test environments, we designed experiments to assess an LLM's ability to explicitly learn and apply grammar rules. Our results demonstrate that while LLMs possess a measurable capacity for explicit learning, this ability diminishes as the complexity of the linguistic phenomena at hand increases. Supervised fine-tuning on chains of thought significantly enhances LLM performance but struggles to generalize to typologically novel or more complex linguistic features. These findings point to the need for more diverse training sets and alternative fine-tuning strategies to further improve explicit learning by LLMs.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

Mar-19-2025

arXiv.org PDF

Add feedback

Country:
- South America > Chile
  - Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
- North America
  - Dominican Republic (0.04)
  - United States > Florida
    - Miami-Dade County > Miami (0.04)
  - Mexico > Mexico City
    - Mexico City (0.04)
- Europe
  - United Kingdom > England
    - Oxfordshire > Oxford (0.04)
    - Cambridgeshire > Cambridge (0.04)
  - Middle East > Cyprus
    - Nicosia > Nicosia (0.04)
  - France > Île-de-France
    - Paris > Paris (0.04)
  - Bulgaria > Sofia City Province
    - Sofia (0.04)
- Asia
  - Middle East > Jordan (0.04)
  - Singapore (0.04)
  - Thailand > Bangkok
    - Bangkok (0.04)
  - Myanmar > Tanintharyi Region
    - Dawei (0.04)
  - Japan > Honshū
    - Tōhoku > Iwate Prefecture
      - Morioka (0.04)
    - Chūbu > Toyama Prefecture
      - Toyama (0.04)

Genre:
- Research Report > New Finding (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found