Towards Better Dental AI: AMultimodal Benchmark and Instruction Dataset for Panoramic X-ray Analysis
–Neural Information Processing Systems
Recent advances in large vision-language models (LVLMs) have demonstrated strong performance on general-purpose medical tasks. However, their effectiveness in specialized domains such as dentistry remains underexplored. In particular, panoramic X-rays, a widely used imaging modality in oral radiology, pose interpretative challenges due to dense anatomical structures and subtle pathological cues, which are not captured by existing medical benchmarks or instruction datasets. To this end, we introduce MMOral, the first large-scale multimodal instruction dataset and benchmark tailored for panoramic X-ray interpretation. MMOral consists of 20,563 annotated images paired with 1.3 million instruction-following instances across diverse task types, including attribute extraction, report generation, visual question answering, and image-grounded dialogue.
Neural Information Processing Systems
Jun-16-2026, 08:16:42 GMT
- Country:
- Asia (1.00)
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (0.92)
- Research Report
- Industry:
- Health & Medicine
- Therapeutic Area > Dental and Oral Health (1.00)
- Nuclear Medicine (1.00)
- Diagnostic Medicine > Imaging (1.00)
- Health & Medicine
- Technology: