II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models 1,3 Xi Feng

Open in new window