Best Prompts for Text-to-Image Models and How to Find Them

Jun-1-2023–arXiv.org Artificial Intelligence

Recent progress in generative models, especially in text-guided diffusion models, has enabled the production of aesthetically-pleasing imagery resembling the works of professional human artists. However, one has to carefully compose the textual description, called the prompt, and augment it with a set of clarifying keywords. Since aesthetics are challenging to evaluate computationally, human feedback is needed to determine the optimal prompt formulation and keyword combination. In this paper, we present a human-in-the-loop approach to learning the most useful combination of prompt keywords using a genetic algorithm. We also show how such an approach can improve the aesthetic appeal of images depicting the same descriptions.

evolutionary algorithm, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

Jun-1-2023

arXiv.org PDF

Add feedback

Country:
- Oceania > Australia
  - New South Wales > Sydney (0.04)
- North America
  - United States > Louisiana
    - Orleans Parish > New Orleans (0.04)
  - Canada > Quebec
    - Montreal (0.04)
- Europe
  - Serbia > Central Serbia
    - Belgrade (0.04)
  - Ireland > Leinster
    - County Dublin > Dublin (0.04)
  - France > Hauts-de-France
    - Nord > Lille (0.04)
- Asia
  - South Korea > Seoul
    - Seoul (0.04)
  - Japan > Honshū
    - Kantō
      - Tokyo Metropolis Prefecture > Tokyo (0.04)
      - Kanagawa Prefecture > Yokohama (0.04)
    - Kansai > Kyoto Prefecture
      - Kyoto (0.04)

Genre:
- Research Report (0.64)

Technology:
- Information Technology > Artificial Intelligence
  - Vision (1.00)
  - Natural Language (1.00)
  - Representation & Reasoning > Optimization (0.46)
  - Machine Learning
    - Evolutionary Systems (0.36)
    - Neural Networks > Deep Learning (0.30)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found