Optimizing Prompts for Text-to-Image Generation

Jan-19-2025, 23:13:40 GMT–Neural Information Processing Systems

Well-designed prompts can guide text-to-image models to generate amazing images. However, the performant prompts are often model-specific and misaligned with user input. Instead of laborious human engineering, we propose prompt adaptation, a general framework that automatically adapts original user input to model-preferred prompts. Specifically, we first perform supervised fine-tuning with a pretrained language model on a small collection of manually engineered prompts. Then we use reinforcement learning to explore better prompts. We define a reward function that encourages the policy to generate more aesthetically pleasing images while preserving the original user intentions.

optimizing prompt, reinforcement, text-to-image generation, (2 more...)

Neural Information Processing Systems

Jan-19-2025, 23:13:40 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Vision (1.00)
  - Machine Learning
    - Neural Networks (0.56)
    - Reinforcement Learning (0.37)