CLiMB: AContinualLearningBenchmark forVision-and-Language Tasks

Feb-11-2026, 16:13:02 GMT–Neural Information Processing Systems

This assumption means learning separate models for language-only, vision-only, and vision-language tasks, as opposed to a single "generalist" model that can handle all modalities or subsets of them [Reed et al., 2022]. Yet, existing work suggests that knowledge grounded in multiple modalities can benefit unimodal tasks [Desai and Johnson, 2021, Jin et al., 2022].

etal, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Feb-11-2026, 16:13:02 GMT

Conferences PDF

Add feedback

Country:
- Europe > Romania > Sud - Muntenia Development Region > Giurgiu County > Giurgiu (0.04)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning (1.00)
  - Representation & Reasoning (0.68)
  - Natural Language (0.68)

Duplicate Docs Excel Report

Title
bd3611971089d466ab4ca96a20f7ab13-Paper-Datasets_and_Benchmarks.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found