MMDU: AMulti-TurnMulti-ImageDialog UnderstandingBenchmarkand Instruction-Tuning DatasetforLVLMs

Neural Information Processing Systems 

Existing LVLM benchmarks primarily focus onsingle-choice questions orshort-form responses, which donotadequately assess the capabilities ofLVLMs inreal-world human-AI interaction applications.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found