Do Construction Distributions Shape Formal Language Learning In German BabyLMs?
Bunzeck, Bastian, Duran, Daniel, Zarrieß, Sina
–arXiv.org Artificial Intelligence
We analyze the influence of utterance-level construction distributions in German child-directed speech on the resulting formal linguistic competence and the underlying learning trajectories for small language models trained on a novel collection of developmentally plausible language data for German. We find that trajectories are surprisingly robust for markedly different distributions of constructions in the training data, which have little effect on final accuracies and almost no effect on global learning trajectories. While syntax learning benefits from more complex utterances, lexical learning culminates in better scores with more fragmentary data. We argue that LMs trained on developmentally plausible data can contribute to debates on how rich or impoverished linguistic stimuli actually are.
arXiv.org Artificial Intelligence
Mar-14-2025
- Country:
- North America
- United States
- Virginia (0.04)
- New Jersey > Bergen County
- Mahwah (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.14)
- Florida
- Miami-Dade County > Miami (0.04)
- Palm Beach County > Boca Raton (0.04)
- Mexico > Mexico City
- Mexico City (0.04)
- United States
- Europe
- Slovenia (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Spain > Valencian Community
- Valencia Province > Valencia (0.04)
- Middle East
- Germany > Saxony
- Leipzig (0.04)
- Asia
- Singapore (0.04)
- Middle East
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.14)
- Republic of Türkiye > Istanbul Province
- Istanbul (0.04)
- Israel > Haifa District
- Haifa (0.04)
- UAE > Abu Dhabi Emirate
- North America
- Genre:
- Research Report (1.00)
- Industry:
- Education > Curriculum > Subject-Specific Education (0.41)
- Technology:
- Information Technology > Artificial Intelligence
- Natural Language (1.00)
- Machine Learning > Neural Networks (1.00)
- Cognitive Science (1.00)
- Representation & Reasoning (0.82)
- Information Technology > Artificial Intelligence