GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI

Mar-22-2026, 00:30:18 GMT–Neural Information Processing Systems

Large Vision-Language Models (LVLMs) are capable of handling diverse data types such as imaging, text, and physiological signals, and can be applied in various fields. In the medical field, LVLMs have a high potential to offer substantial assistance for diagnosis and treatment. Before that, it is crucial to develop benchmarks to evaluate LVLMs' effectiveness in various medical applications. Current benchmarks are often built upon specific academic literature, mainly focusing on a single domain, and lacking varying perceptual granularities. Thus, they face specific challenges, including limited clinical relevance, incomplete evaluations, and insufficient guidance for interactive LVLMs. To address these limitations, we developed the GMAI-MMBench, the most comprehensive general medical AI benchmark with well-categorized data structure and multi-perceptual granularity to date.

artificial intelligence, natural language, proceedings, (10 more...)

Neural Information Processing Systems

Mar-22-2026, 00:30:18 GMT

Conferences Web Page

Add feedback

Industry:
- Health & Medicine (1.00)

Technology:
- Information Technology > Artificial Intelligence > Natural Language (0.57)