Teacher Demonstrations in a BabyLM's Zone of Proximal Development for Contingent Multi-Turn Interaction
Salhan, Suchir, Gu, Hongyi, Rooein, Donya, Galvan-Sosa, Diana, Gaudeau, Gabrielle, Caines, Andrew, Yuan, Zheng, Buttery, Paula
–arXiv.org Artificial Intelligence
Multi-turn dialogues between a child and a caregiver are characterized by a property called contingency - that is, prompt, direct, and meaningful exchanges between interlocutors. We introduce ContingentChat, a teacher-student framework that benchmarks and improves multi-turn contingency in a BabyLM trained on 100M words. Using a novel alignment dataset for post-training, BabyLM generates responses that are more grammatical and cohesive. Experiments with adaptive teacher decoding strategies show limited additional gains. ContingentChat demonstrates the benefits of targeted post-training for dialogue quality and indicates that contingency remains a challenging goal for BabyLMs.
arXiv.org Artificial Intelligence
Oct-24-2025
- Country:
- Asia
- China (0.04)
- Middle East
- Jordan (0.04)
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.14)
- Singapore (0.04)
- Thailand > Bangkok
- Bangkok (0.04)
- Europe
- Austria > Vienna (0.14)
- France
- Grand Est > Bas-Rhin
- Strasbourg (0.04)
- Provence-Alpes-Côte d'Azur > Bouches-du-Rhône
- Marseille (0.04)
- Grand Est > Bas-Rhin
- Germany > Berlin (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.14)
- North America > United States
- California > Santa Clara County
- Stanford (0.04)
- Florida > Miami-Dade County
- Miami (0.14)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Oregon > Multnomah County
- Portland (0.04)
- Washington > King County
- Seattle (0.04)
- California > Santa Clara County
- Asia
- Genre:
- Research Report (0.81)
- Industry:
- Education (1.00)
- Technology:
- Information Technology > Artificial Intelligence
- Cognitive Science (0.93)
- Machine Learning > Neural Networks (0.67)
- Natural Language
- Chatbot (0.93)
- Large Language Model (0.94)
- Representation & Reasoning (1.00)
- Information Technology > Artificial Intelligence