MedSG-Bench: A Benchmark for Medical Image Sequences Grounding
–Neural Information Processing Systems
Visual grounding is essential for precise perception and reasoning in multimodal large language models (MLLMs), especially in medical imaging domains. While existing medical visual grounding benchmarks primarily focus on single-image scenarios, real-world clinical applications often involve sequential images, where accurate lesion localization across different modalities and temporal tracking of disease progression (e.g., pre-vs.
Neural Information Processing Systems
Jun-12-2026, 15:51:01 GMT
- Industry:
- Health & Medicine > Diagnostic Medicine > Imaging (0.44)
- Technology:
- Information Technology > Artificial Intelligence
- Vision (0.89)
- Natural Language (0.59)
- Information Technology > Artificial Intelligence