MMDU: AMulti-TurnMulti-ImageDialog UnderstandingBenchmarkand Instruction-Tuning DatasetforLVLMs
–Neural Information Processing Systems
Existing LVLM benchmarks primarily focus onsingle-choice questions orshort-form responses, which donotadequately assess the capabilities ofLVLMs inreal-world human-AI interaction applications.
Neural Information Processing Systems
Feb-8-2026, 01:15:40 GMT