Beyond Overcorrection: Evaluating Diversity in T2I Models with DivBench
Friedrich, Felix, Welsch, Thiemo Ganesha, Brack, Manuel, Schramowski, Patrick, Kersting, Kristian
–arXiv.org Artificial Intelligence
Current diversification strategies for text-to-image (T2I) models often ignore contextual appropriateness, leading to over-diversification where demographic attributes are modified even when explicitly specified in prompts. This paper introduces DIVBENCH, a benchmark and evaluation framework for measuring both under- and over-diversification in T2I generation. Through systematic evaluation of state-of-the-art T2I models, we find that while most models exhibit limited diversity, many diversification approaches overcorrect by inappropriately altering contextually-specified attributes. We demonstrate that context-aware methods, particularly LLM-guided FairDiffusion and prompt rewriting, can already effectively address under-diversity while avoiding over-diversification, achieving a better balance between representation and semantic fidelity.
arXiv.org Artificial Intelligence
Jul-11-2025
- Country:
- Africa > Cameroon (0.04)
- Europe
- France (0.05)
- Germany > Hesse
- Darmstadt Region > Darmstadt (0.05)
- North America > United States (0.28)
- Genre:
- Research Report (0.82)
- Industry:
- Government (0.95)
- Technology: