MedSG-Bench: ABenchmark for Medical Image Sequences Grounding

Jun-18-2026, 04:36:50 GMT–Neural Information Processing Systems

Visual grounding is essential for precise perception and reasoning in multimodal large language models (MLLMs), especially in medical imaging domains. While existing medical visual grounding benchmarks primarily focus on single-image scenarios, real-world clinical applications often involve sequential images, where accurate lesion localization across different modalities and temporal tracking of disease progression (e.g., pre-vs.

arxiv preprint arxiv, large language model, machine learning, (19 more...)

Neural Information Processing Systems

Jun-18-2026, 04:36:50 GMT

Conferences PDF

Add feedback

Country:
- Europe (0.92)
- Asia > China (0.28)

Genre:
- Research Report
  - New Finding (1.00)
  - Experimental Study (1.00)

Industry:
- Health & Medicine
  - Health Care Technology (1.00)
  - Diagnostic Medicine > Imaging (1.00)
  - Therapeutic Area
    - Oncology (1.00)
    - Cardiology/Vascular Diseases (1.00)

Technology:
- Information Technology
  - Sensing and Signal Processing > Image Processing (1.00)
  - Artificial Intelligence
    - Vision (1.00)
    - Natural Language
      - Large Language Model (1.00)
      - Chatbot (1.00)
    - Machine Learning > Neural Networks
      - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found