Using LLMs as prompt modifier to avoid biases in AI image generators

Apr-16-2025–arXiv.org Artificial Intelligence

This study examines how Large Language Models (LLMs) can reduce biases in text-to-image generation systems by modifying user prompts. We define bias as a model's unfair deviation from population statistics given neutral prompts. Our experiments with Stable Diffusion XL, 3.5 and Flux demonstrate that LLM-modified prompts significantly increase image diversity and reduce bias without the need to change the image generators themselves. While occasionally producing results that diverge from original user intent for elaborate prompts, this approach generally provides more varied interpretations of underspecified requests rather than superficial variations. The method works particularly well for less advanced image generators, though limitations persist for certain contexts like disability representation. All prompts and generated images are available at https://iisys-hof.github.io/llm-prompt-img-gen/

large language model, machine learning, natural language, (22 more...)

arXiv.org Artificial Intelligence

Apr-16-2025

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.46)
- Europe > Germany (0.29)

Genre:
- Research Report (0.70)

Industry:
- Government > Military (0.47)
- Health & Medicine (0.47)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found