Holistic Evaluation of Text-to-Image Models
–Neural Information Processing Systems
The stunning qualitative improvement of text-to-image models has led to their widespread attention and adoption. However, we lack a comprehensive quantitative understanding of their capabilities and risks. To fill this gap, we introduce a new benchmark, Holistic Evaluation of Text-to-Image Models (HEIM). Whereas previous evaluations focus mostly on image-text alignment and image quality, we identify 12 aspects, including text-image alignment, image quality, aesthetics, originality, reasoning, knowledge, bias, toxicity, fairness, robustness, multilinguality, and efficiency. We curate 62 scenarios encompassing these aspects and evaluate 26 state-of-the-art text-to-image models on this benchmark.
Neural Information Processing Systems
Jan-20-2025, 00:26:55 GMT
- Country:
- North America > United States > California > Santa Clara County > Palo Alto (0.09)
- Technology:
- Information Technology > Artificial Intelligence > Vision (1.00)