MMIG-Bench: Towards Comprehensive and Explainable Evaluation of Multi-Modal Image Generation Models
–Neural Information Processing Systems
Recent multimodal image generators such as GPT-4o, Gemini 2.0 Flash, and Gemini 2.5 Pro excel at following complex instructions, editing images and maintaining concept consistency.
Neural Information Processing Systems
Jun-22-2026, 21:24:38 GMT