GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI Pengcheng Chen 1,2 Jin Ye1,3 Guoan Wang 1,4 Yanjun Li1,4

Mar-26-2025, 19:38:47 GMT–Neural Information Processing Systems

Large Vision-Language Models (LVLMs) are capable of handling diverse data types such as imaging, text, and physiological signals, and can be applied in various fields. In the medical field, LVLMs have a high potential to offer substantial assistance for diagnosis and treatment. Before that, it is crucial to develop benchmarks to evaluate LVLMs' effectiveness in various medical applications. Current benchmarks are often built upon specific academic literature, mainly focusing on a single domain, and lacking varying perceptual granularities.

large language model, machine learning, question answering, (19 more...)

Neural Information Processing Systems

Mar-26-2025, 19:38:47 GMT

Conferences PDF

Add feedback

Country:
- Asia (1.00)
- Europe (1.00)
- North America > United States
  - California > San Francisco County > San Francisco (0.14)

Genre:
- Research Report > New Finding (0.45)

Industry:
- Health & Medicine
  - Diagnostic Medicine > Imaging (1.00)
  - Health Care Technology (1.00)
  - Nuclear Medicine (1.00)
  - Therapeutic Area
    - Orthopedics/Orthopedic Surgery (0.67)
    - Pulmonary/Respiratory Diseases (0.67)
    - Immunology (1.00)
    - Cardiology/Vascular Diseases (1.00)
    - Hematology (0.67)
    - Gastroenterology (0.93)
    - Oncology > Carcinoma (0.45)
    - Ophthalmology/Optometry (1.00)
    - Infections and Infectious Diseases (1.00)

Technology:
- Information Technology
  - Artificial Intelligence
    - Machine Learning > Neural Networks
      - Deep Learning (1.00)
    - Natural Language
      - Large Language Model (1.00)
      - Question Answering (0.93)
    - Representation & Reasoning (1.00)
    - Vision (1.00)
  - Data Science (0.92)
  - Sensing and Signal Processing > Image Processing (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found