A multimodal developmental benchmark for language learning
–Neural Information Processing Systems
How (dis)similar are the learning trajectories of vision-language models and children? Recent modeling work has attempted to understand the gap between models' and humans' data efficiency by constructing models trained on less data, especially multimodal naturalistic data. However, such models are often evaluated on adultlevel benchmarks, with limited breadth in language abilities tested, and without direct comparison to behavioral data.
Neural Information Processing Systems
May-31-2025, 07:31:29 GMT
- Country:
- North America > United States (0.68)
- Genre:
- Research Report > New Finding (0.46)
- Industry:
- Technology:
- Information Technology > Artificial Intelligence
- Cognitive Science (1.00)
- Machine Learning > Pattern Recognition (0.46)
- Natural Language > Text Processing (0.47)
- Representation & Reasoning (0.93)
- Vision (1.00)
- Information Technology > Artificial Intelligence