M3Exam: A Multilingual, Multimodal, Multilevel Benchmark for Examining Large Language Models

Neural Information Processing Systems 

Multimodal LLMs also perform poorly with complex multimodal questions.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found