Holistic Evaluation of Text-to-Image Models

May-27-2025, 12:51:02 GMT–Neural Information Processing Systems

The stunning qualitative improvement of text-to-image models has led to their widespread attention and adoption. However, we lack a comprehensive quantitative understanding of their capabilities and risks. To fill this gap, we introduce a new benchmark, Holistic Evaluation of Text-to-Image Models (HEIM). Whereas previous evaluations focus mostly on image-text alignment and image quality, we identify 12 aspects, including text-image alignment, image quality, aesthetics, originality, reasoning, knowledge, bias, toxicity, fairness, robustness, multilinguality, and efficiency. We curate 62 scenarios encompassing these aspects and evaluate 26 state-of-the-art text-to-image models on this benchmark.

artificial intelligence, holistic evaluation, text-to-image model, (2 more...)

Neural Information Processing Systems

May-27-2025, 12:51:02 GMT

Conferences Web Page

Add feedback

Country:
- North America > United States > California > Santa Clara County > Palo Alto (0.09)

Technology:
- Information Technology > Artificial Intelligence > Vision (1.00)