A multimodal developmental benchmark for language learning

May-31-2025, 07:31:29 GMT–Neural Information Processing Systems

How (dis)similar are the learning trajectories of vision-language models and children? Recent modeling work has attempted to understand the gap between models' and humans' data efficiency by constructing models trained on less data, especially multimodal naturalistic data. However, such models are often evaluated on adultlevel benchmarks, with limited breadth in language abilities tested, and without direct comparison to behavioral data.

machine learning, natural language, pattern recognition, (21 more...)

Neural Information Processing Systems

May-31-2025, 07:31:29 GMT

Conferences PDF

Add feedback

Country:
- North America > United States (0.68)

Genre:
- Research Report > New Finding (0.46)

Industry:
- Education > Curriculum
  - Subject-Specific Education (0.41)
- Health & Medicine > Therapeutic Area
  - Neurology (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Cognitive Science (1.00)
  - Machine Learning > Pattern Recognition (0.46)
  - Natural Language > Text Processing (0.47)
  - Representation & Reasoning (0.93)
  - Vision (1.00)