Sequential Compositional Generalization in Multimodal Models

Open in new window