Best Prompts for Text-to-Image Models and How to Find Them
Pavlichenko, Nikita, Ustalov, Dmitry
–arXiv.org Artificial Intelligence
Recent progress in generative models, especially in text-guided diffusion models, has enabled the production of aesthetically-pleasing imagery resembling the works of professional human artists. However, one has to carefully compose the textual description, called the prompt, and augment it with a set of clarifying keywords. Since aesthetics are challenging to evaluate computationally, human feedback is needed to determine the optimal prompt formulation and keyword combination. In this paper, we present a human-in-the-loop approach to learning the most useful combination of prompt keywords using a genetic algorithm. We also show how such an approach can improve the aesthetic appeal of images depicting the same descriptions.
arXiv.org Artificial Intelligence
Jun-1-2023
- Country:
- Oceania > Australia
- New South Wales > Sydney (0.04)
- North America
- United States > Louisiana
- Orleans Parish > New Orleans (0.04)
- Canada > Quebec
- Montreal (0.04)
- United States > Louisiana
- Europe
- Serbia > Central Serbia
- Belgrade (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- France > Hauts-de-France
- Serbia > Central Serbia
- Asia
- South Korea > Seoul
- Seoul (0.04)
- Japan > Honshū
- Kantō
- Tokyo Metropolis Prefecture > Tokyo (0.04)
- Kanagawa Prefecture > Yokohama (0.04)
- Kansai > Kyoto Prefecture
- Kyoto (0.04)
- Kantō
- South Korea > Seoul
- Oceania > Australia
- Genre:
- Research Report (0.64)
- Technology:
- Information Technology > Artificial Intelligence
- Vision (1.00)
- Natural Language (1.00)
- Representation & Reasoning > Optimization (0.46)
- Machine Learning
- Evolutionary Systems (0.36)
- Neural Networks > Deep Learning (0.30)
- Information Technology > Artificial Intelligence