SEAM: Semantically Equivalent Across Modalities Benchmark for Vision-Language Models

Open in new window