MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models

Open in new window